THE AMERICAN MATHEMATICAL 


= 
SARL 


Volume 106, Number 1 


P. J. McKenna 


ONTHLY 


Large Torsional Oscillations in Suspension 
Bridges Revisited: Fixing an Old 
Approximation | 


Inverse Conjugacies and Reversing 


January 1999 


1 


Geoffrey R. Goodson 19 
Symmetry Groups 
Dmitry Fuchs More on Paperfolding 27 
Serge Tabachnikov 
L. R. Bragg Trigonometric Integrals and Hadamard 36 
Products 
Richard Blecksmith Cluster Primes 43 
Paul Erdés 
J. L. Selfridge 
NOTES 
John A. Zuehlke Fermat's Last Theorem for Gaussian 49 
Integer Exponents 
M. J. Jamieson On Rational Function Approximations to 50 
Square Roots 
A. J. van der Poorten A Note on Jacobi Symbols and Continued 52 
P. G. Walsh Fractions 
THE EVOLUTION OF... | 
|. G. Bashmakova The Birth of Literal Algebra 57 
G. S. Smirnova 
PROBLEMS AND 67 
SOLUTIONS 
REVIEWS 
Arnold Allen Mathematics: From the Birth of Numbers. 77 
By Jan Gullberg 
Harold P. Boas Editor's Corner 82 
Jeffrey Shallit Handbook of Applied Cryptography. 85 
By Alfred J. Menezes, Paul C. van Oorschot, 
and Scott A. Vanstone 
The Cryptographic Imagination: 85 
Secret Writing from Edgar Poe to the 
| Internet. By Shawn James Rosenheim 
TELEGRAPHIC 89 
REVIEWS 


AN OFFICIAL PUBLICATION OF THE MATHEMATICAL ASSOCIATION OF AMERICA 


NOTICE TO AUTHORS 


The MONTHLY publishes articles, as well as notes and 
other features, about mathematics and the profes- 
sion. Its readers span a broad spectrum of mathe- 
matical interests, and include professional mathe- 
maticians as well as students of mathematics at all 
collegiate levels. Authors are invited to submit articles 
and notes that bring interesting mathematical ideas 
to a wide audience of MONTHLY readers. 


The MONTHLY’s readers expect a high standard of 
exposition; they expect articles to inform, stimulate, 
challenge, enlighten, and even entertain. MONTHLY 
articles are meant to be read, enjoyed, and dis- 
cussed, rather than just archived. Articles may be 
expositions of old or new results, historical or bio- 
graphical essays, speculations or definitive treat- 
ments, broad developments, or explorations of a sin- 
gle application. Novelty and generality are far less 
important than clarity of exposition and broad appeal. 
Appropriate figures, diagrams, and photographs are 
encouraged. 


Notes are short, sharply focussed, and possibly infor- 
mal. They are often gems that provide a new proof of 
an old theorem, a novel presentation of a familiar 
theme, or a lively discussion of a single issue. 


Articles and Notes should be sent to the Editor: 


ROGER A. HORN 

1515 Mineral Square, Room 142 
University of Utah 

Salt Lake City, UT 84112 


Please send your email address and 3 copies of the 
complete manuscript (including all figures with cap- 
tions and lettering), typewritten on only one side of 
the paper. In addition, send one original copy of all 
figures without lettering, drawn carefully in black ink 
on separate sheets of paper. 


Letters to the Editor on any topic are invited; please 
send to the MONTHLY’s Utah office. Comments, criti- 
cisms, and suggestions for making the MONTHLY 
more lively, entertaining, and informative are wel- 
come. 


See the MONTHLY section of MAA Online for current 
information such as contents of issues, descriptive 
summaries of forthcoming articles, tips for authors, 
and preparation of manuscripts in TEX: 


http: // www.maa.org / 
Proposed problems or solutions should be sent to: 


DANIEL ULLMAN, MONTHLY Problems 
Department of Mathematics 

The George Washington University 
2201 G Street, NW, Room 428A 
Washington, DC 20052 


Please send 2 copies of all problems/solutions mate - 
rial, typewritten on only one side of the paper. 


EDITOR: ROGER A. HORN 
monthly@math.utah.edu 


ASSOCIATE EDITORS: 


WILLIAM ADKINS VICTOR KATZ 

DONNA BEERS STEVEN KRANTZ 

HAROLD BOAS JIMMIE LAWSON 

RICHARD BUMBY CATHERINE COLE McGEOCH 


JAMES CASE RICHARD NOWAKOWSKI 
JANE DAY ARNOLD OSTEBEE 
JOHN DUNCAN KAREN PARSHALL 
PETER DUREN EDWARD SCHEINERMAN 
GERALD EDGAR ABE SHENITZER 
JOHN EWING WALTER STROMQUIST 
JOSEPH GALLIAN ALAN TUCKER 
ROBERT GREENE DANIEL ULLMAN 
RICHARD GUY DANIEL VELLEMAN 
PAUL HALMOS ANN WATKINS 
GUERSHON HAREL DOUGLAS WEST 
DAVID HOAGLIN HERBERT WILF 
EDITORIAL ASSISTANTS: 

ARLEE CRAPO 

MEGAN TONKOVICH 


Reprint permission: 
DONALD ALBERS, Director of Publication 


Advertising Correspondence: 
Mr. JOE WATSON, Advertising Manager 


Change of address, missing issues inquiries, and 
other subscription correspondence: 

MAA Service Center 

maahq@maa.org 


All at the address: 


The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, DC 20036 


Microfilm Editions: University Microfilms International, 
Serial Bid coordinator, 300 North Zeeb Road, Ann 
Arbor, MI 48106. 


The AMERICAN MATHEMATICAL MONTHLY (ISSN 
0002-9890) is published monthly except bimonthly — 
June-July and August-September by the Mathemati- 
cal Association of America at 1529 Eighteenth Street, 
N.W., Washington, DC 20036 and Montpelier, VT. 
Copyrighted by the Mathematical Association of 
America (Incorporated), 1999, including rights to this 
journal issue as a whole and, except where otherwise 
noted, rights to each individual contribution. ‘‘Permis- 
sion to make copies of individual articles, in paper or 
electronic form, including posting on personal and 
class web pages, for educational and scientific use is 
granted without fee provided that copies are not made 
or distributed for profit or commercial advantage and 
that copies bear the following copyright notice: 
[Copyright the Mathematical Association of America 
1999. All rights reserved.] Abstracting, with credit is 
permitted. To copy otherwise or to republish, re- 
quires specific permission of the MAA’s Director of 
Publication and possibly a fee.’’ Second class postage 
paid at Washington, DC, and additional mailing of- 
fices. Postmaster: Send address changes to the 
American Mathematical Monthly, Membership / 
Subscription Department, MAA, 1529 Eighteenth 
Street, N.W., Washington, DC, 20036-1385. 


Large Torsional Oscillations 
in Suspension Bridges Revisited: 
Fixing an Old Approximation 


P. J. McKenna 


1. INTRODUCTION. In Inventions and Technology, Frederic D. Schwartz writes 
“The Tacoma Narrows Bridge collapse is technology’s version of the JFK assassi- 
nation. There’s the grainy black-and-white film endlessly scrutinized frame by 
frame; the reams of expert analysis next to impossible for a layperson to evaluate; 
and, of course, the buffs who are convinced that only they know the real story” [29]. 

He was referring to a “genteel professorial catfight” [28] between some engi- 
neering writers and me about the explanation of the famous destructive large 
amplitude torsional oscillation captured on film on November 7th, 1940, and now a 
staple in physics classes. 

Until the collapse of the Tacoma Narrows bridge, it seems that most suspension 
bridge building had been largely by rule of thumb, without a theoretical underpin- 
ning. As a result, early bridges, including the Golden Gate bridge and the Tacoma 
Narrows bridge, exhibited interesting behaviour: 


1. they were prone to large-scale oscillation, both torsional and purely vertical 

2. occurrence of this large-scale oscillation is dependent on initial conditions 
and can be started by a single gust [25] 

3. large vertical oscillations could rapidly change, virtually instantaneously, to 
torsional oscillation 

4. these bridges exhibit localised travelling wave solutions 

5. for small oscillation, the observed behaviour is almost perfectly linear. 


Good historical sources for these behaviours are [2] and [5]. 

Of course, these behaviours strongly suggest nonlinearity, and in the late 
eighties, following recent and rapid progress in nonlinear analysis of boundary 
value problems that had just been taking place, I and co-workers attempted to give 
a mathematical explanation for these phenomena [13], [16-19]. Our starting point 
was that an unloaded cable cannot be described by the usual Hooke’s law, since it 
resists expansion but not compression. Thus if an unloaded cable is expanded 
downward by a distance u from the unloaded state, the cable should have a 
resisting force ku*, in other words, ku if u is positive, and 0 if u is negative. When 
the cable is loaded, it stretches to a new equilibrium, around which it obeys the 
linear Hooke’s law until deviations from equilibrium become large enough to result 
in loss of tension when the cable approaches the unloaded state. 

Using the tools of nonlinear analysis and numerical investigations, we showed 
that an equation for a nonlinearly supported beam with this type of nonlinearity 
could explain both large amplitude vertical periodic oscillations and the travelling 
wave behaviour. With the usual fervor of the newly converted, I suggested that the 
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nonlinearity induced by the alternate slackening and tightening of the cables could 
also explain the more famous large amplitude torsional oscillations. 

The purpose of this paper is to explore a startling and different possibility. If 
one is interested solely in a simple version of the Tacoma Narrows oscillation, 
namely large amplitude torsional oscillation about equilibrium, then the puzzle 
may have its roots in a simple trigonometry approximation introduced in the 
engineering literature fifty years ago [5], and unwittingly reproduced ever since, for 
example in [1]. 

After the collapse, the initial response of the engineering community, which has 
served well over the years, was to develop a theory of small oscillations and to 
construct bridges so that the oscillations stayed in that range. This theory involved 
many linearising approximations. 

The theory was successful in the practical sense. Newer bridges no longer 
engaged in interesting large amplitude behaviour. And older ones like the Golden 
Gate were soon modified so that they didn’t either. 

Of course, when one makes the small angle and other near-equilibrium assump- 
tions, a theory emerges that cannot explain the large-amplitude motions. Re- 
designing the bridge to remove the offending behaviour is not the same as 
mathematically understanding its cause. 

This is what the distinguished civil engineering writer, the late Mario Salvadori, 
meant when he wrote to me “... having found obvious and effective physical ways 
of avoiding the problem, engineers will not give too much attention to the 
mathematical solution of this fascinating puzzle and [I] am delighted to learn that 
mathematicians like you are interested in it” [22]. 

In this paper, we re-derive a mechanical model for a beam or plate oscillating 
torsionally about equilibrium, and suspended at both sides or ends by cables. We 
show how the ‘small-angle’ linearisations can remove a large class of large-ampli- 
tude nonlinear solutions that can be sustained by extremely small periodic forcing 
terms. 

Based on the original report of the Tacoma Narrows disaster [2], we choose 
appropriate values for the physical constants in the system of differential equa- 
tions. | 

Then we explore the implications and consequences of making, or not making, 
the trigonometry approximation, and show that even if no loss of tension in the 
cables is assumed, the nonlinearities introduced by the geometry of the situation 
are sufficient to explain the large amplitude oscillations seen before the collapse. 
The large amplitude oscillations that can result from small forcing terms exist over 
the right range of frequency and amplitude to match the historical observations. 

The last nonlinear effect that we listed and would like to explain is the virtually 
instantaneous change from vertical to torsional oscillation. In Section 4, we explore 
some of the consequences of assuming that the cables briefly lose tension in a 
transient way. We observe numerically that when purely vertical oscillations are 
large enough to allow the supporting cables to lose tension, and the system is 
subjected to tiny torsional perturbations, the system becomes violently unstable in 
the torsional dimension. 


2. SETTING UP THE MODEL. There is nothing controversial in this section. We 
set up the equations for vertical and torsional motion of a rod, (or plate), 
suspended by springs at both ends, (or sides), and free to move vertically and 
rotate about its center of gravity. In the first subsection, we derive the equations 
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from the geometry of the situation, and in the second, we choose reasonable values 
for the physical constants and the forcing terms. 


2.1 The equations for vertical and torsional oscillations. If a spring with spring 
constant K is extended by a distance y, the potential energy is Ky*/2. If a rod of 
mass m and length 2/ rotates about its center of gravity with angular velocity 0, 
then, its kinetic energy is given by (1/6)ml?(6)* [31, p. 202]. Assume the rod is 
suspended as in Figure 1 by springs that resist expansion with a spring constant K 
at each end. Let y denote the downward distance of the center of gravity of the 
rod from the unloaded state. Let @ denote the angle of the rod from the 
horizontal. Let y* be the positive part of y, that is, y* = max {y, O}. 


nonlinear cable-like springs 


The vertical 
deflection of 
the center of 


gravity y(t) 


the deflection from horizontal 


Figure 1. A cross-section of the bridge as an inflexible rod supported by two springs at each side, giving 
rise to the system of equations (3) and (4) and simplifying to (5) and (6) when there is no loss of tension. 


The potential energy due to gravity is —mgy. The extension is (y — / sin 6)” in 
one spring and (y + /sin 6)” in the other. Actually, this involves another hidden 
small-angle assumption, namely that the cables remain vertical, although the 
lateral motion of the ends of the plate or rod deflects them slightly to the left or 
right. Thus, this model actually overstates slightly the resistance to torsional 
motion. When the cables are of length 100 feet and the lateral motion is about six 
feet, as they were in the Tacoma Narrows bridge, this seems a reasonable 
approximation. | 

Thus, the total potential energy is 


V = (K/2)(((y —Isin 0)* ) + (y +Usin @)*) — mey 
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and the total kinetic energy is 
T = my?/2 + (1/6) ml’6’. 


Let L=T—V and put (d/dt)SL/60) = (SL/80) and (d/dt)(6L/6y) = 
6L/édy. We obtain the equations 


(1/3) ml6 = (K1)cos 6((y — Isin 6)” —(y + /sin 6)” ) (1) 
jy = —-K((y —/sin 0)" +(y +Jsin 6)” ) + mg (2) 


Since the springs are assumed to remain vertical, the force exerted by the spring is 
not in the torsional direction perpendicular to the rod but is at an angle 6 to that 
perpendicular. This is why the additional term cos@ occurs in the torsional 
equation from the springs. 

Simplifying and adding a small viscous damping term 56 to the first equation 
and dy to the second, and adding an external forcing term f(t) (to be determined 
later), to the torsional equation, we end up with the system 


§ = —80 + (3K/ml)cos al(y —Isin 6)” —(y + /sin a) | + f(t) (3) 


j = —dy — (K/m)[(y — J sin 0)" +(y + sin 0)" | +8 (4) 


Finally, if we assume the cables never lose tension so that (y —/sin 6)*= 
(y —/sin 6) and (y + /sin 6)*=(y +/sin 6), we end up with the uncoupled 
equations 


§ = —60 — (6K/m)cos 6 siné + f(t) (5) 
for the torsional motion and 


y= —dy — (2K/m)y +g (6) 
for the vertical motion. 

Now, we are in a position to introduce the error that we believe is key to the - 
failure to understand the large amplitude torsional oscillation in the case of the 
Tacoma Narrows bridge. If we are interested in studying very small oscillations, we 
could call it an approximation, but for large oscillations, it is an error. 

We put sin 6 = 6 and cos 6 = 1. This gets us to the two equations 


§= —60 — (6K/m)6@ + f(t) (7) 
and 
y= —6y —(2K/m)y +8 (8) 


This is the point at which the discussion of torsional oscillation starts in the 
engineering literature. For example, in [26] one finds torsional motion described by 


I(@+ 6a + wa) = fa, (9) 


where a is the torsional angle and fa is an external force of aerodynamic origin. 
In [4], this equation is used to describe the motion of the Tacoma Narrows bridge 
prior to the collapse. Even in recent engineering literature [1], which claims to 
derive “the coupled equations of motion in their most general and nonlinear 
form”, this same mistake is reproduced, although “...one has to expect responses 
of large vibrational amplitudes which necessitates a nonlinear formulation as 
presented in this paper.” 

In the pioneering studies of small oscillations of suspension bridges, sum- 
marised in [5], it is clear why this approximation was made. There was simply no 
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technology available to solve nonlinear equations like (5). There was no choice but 
to linearize the equation and hope the errors introduced were not too big. Indeed, 
experience showed that for very small oscillations, this was reasonable. 

However, today we can solve (5) accurately. Thus we can determine whether 
there is an important difference between the trigonometrically correct model and 
the linear one. But before we do this, we should choose appropriate constants. 
This is the task of the next subsection. 


2.2 Choosing physical constants and external forcing term. Our main source is [2]. 
Since the model is very simple, we do not concern ourselves with very precise 
choice of constants but content ourselves with getting the magnitudes about right. 
The mass of a foot of the bridge was about 5,000 lbs. so we choose the m in our 
equations to be 2,500 kgs. The width of the bridge was about 12 meters so we 
choose / to be 6. 

The bridge would deflect about .5 meters when loaded with 100 kgs. for each 
.3 m. of bridge. Since there are two springs, this gives the equation 2K(.5) = 
100(9.8) for 2Ky = mg, and we choose K = 1000. 

For the external forcing term, we choose f(t) = Asin wt and investigate the 
response of the different equations to this type of periodic forcing. The frequency 
of the motion of the bridge before the collapse was about 12 to 14 cycles per 
minute so we take p between 1.2 to 1.6. 

There does not seem to be much information about the amplitude of the forcing 
term, although there was an attempt to measure it in [26]. Apparently, one can 
measure small oscillations caused by forces that induce oscillations of about 3 deg. 
“It was found necessary to permit only small amplitudes of oscillation to occur, 
(e.g. in torsion 0 < a < +3deg)” [sic]. Thus, we choose A small enough to create 
oscillation of this order of magnitude in the linear model. 

Finally, there is a consensus that the term describing the viscous damping 
should have a coefficient of about .01 [2]. 

We now have a system that, in the absence of forcing, settles down to an 
equilibrium with y = 12.25 and 6 = 0. As long as the oscillation is such that the 
y +/sin6@ remains below zero, i.e., the deflections upward from equilibrium do not 
exceed about 12 meters, the equations remain uncoupled, and we investigate the 
long-term behaviour of the forced pendulum equation (5) versus the linearized 
version (7) with these constants and a variety of initial conditions and small forcing 
terms. 


2.3 What happened at Tacoma Narrows. We recall what happened before the 
collapse. Again, our source is [2]. 

The bridge engaged in vertical oscillations even during construction. “Prior to 
10.00 A.M. on the day of the failure, there were no recorded instances of the oscillation 
being otherwise than with the two cables in phase and with no torsional motion.” 

On November 7, for some time before 10:00 AM, the bridge was engaged in 
what seemed like normal vertical motions, with amplitudes of about 5 feet and a 
frequency of about 38 per minute. The motion was apparently more violent than 
usual, with eight nodes. Then the torsional motion began, as the bridge was being 
observed by Professor F. B. Farquharson [2]: 


...a violent change in the motion was noted. This change appeared to take 
place without any intermediate stages and with such extreme violence that 
the span appeared to be about to roll completely over. 
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The motion, which a moment before had involved a number of waves, (nine 
or ten) [This means a eight-or-nine noded vertical motion], had shifted 
almost instantly {our emphasis] to two [a one-noded torsional motion]. 


At the moment of first observing the main span,..., the motion had a 
frequency of 14 cycles per minute. ...the node was at the center of the main 
span and the structure was subjected to a violent torsional action about this 
point. 

At times, for a short period, the motion changed over to a single wave on 
each cable but still with the cables out of step. This motion, which never 
lasted long seemed to be of slightly greater amplitude than the single-noded 
motion, but of the same frequency. [This motion, of double amplitude of 28 
feet continued for approximately forty-five minutes. ] 


There is no consensus on what caused the sudden change to torsional motion. In 
[23, p. 209], this transition is explained as “some fortuitous condition broke the 
bridge action.” 

It may have been a minor structural failure, enough to jar the bridge in a 
torsional direction. We explore the consequences of just such a single push on the 
two models we compare: the linear and the trigonometric ones. Thus, it is our task 
to look for large-amplitude periodic motions that are primarily torsional with an 
amplitude of about +1 radian, corresponding to a small torsional forcing term 
with a frequency in the neighbourhood of 12 to 14 cycles per minute. 


2.4 A few words about forcing. We have to choose some method to model the 
aerodynamic and other forces acting on the bridge that cause it to go into a 
periodic motion. There is really no way to say precisely what the forces would be 
when a huge structure is oscillating up and down by 28 feet every five seconds. 

According to [26], for a cross-section similar to the Tacoma Narrows bridge in a 
wind tunnel, the aerodynamic forces induced approximately sinusoidal oscillations 
of amplitude of plus or minus three degrees. In all our experiments, we explore the 
response of the system to a sinusoidal forcing term of the form Asin (yt). We then 
choose the value of A in order to induce oscillations of three degrees near 
equilibrium in the linear model. 

In this sense, we have chosen the particular form Asin( mt) as a ‘generic 
oscillatory force, with the right frequency and amplitude. The conclusions would 
be the same for any other oscillatory force of roughly the same magnitude and 
frequency. For example, we repeated some of these experiments for the forcing 
term A(sin( wt))°. The results were qualitatively the same: small forcing could give 
rise to either large or small periodic long-term behaviour, with the eventual 
outcome dependent on the initial conditions. 


? 


3. NUMERICAL EXPERIMENTS: THE EFFECT OF NOT LINEARIZING. We 
now investigate the response of the two equations to various initial conditions and 
forcing terms. Our first equation, with the correct trigonometry is 


§ = —0.016 — (2.4)cos Osin 6+ Asin pt, (10) 
which when linearized becomes 


6 = —0.010 — (2.4)@ + Asin pt. (11) 
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3.1 Long-term behaviour with and without The Error. We start with initial condi- 
tions that mimic a large torsional push. We choose @(0) = 1.2 and 6(0) = 0, and 
start with w= 1.2 and A=0.06. We run the initial value problem for one 
thousand periods to see what the system has settled down to. This is shown in 
Figure 2. 


theta in radians 
° 


to: 0102. 03 04. 05 06.07 0s Oo 1 
The last 16 of 1000 periods of the forcing term 


NN 


The last 16 of 1000 periods of the forcing term 


theta in radians 


Figure 2. The difference in long-term outcomes depending on whether we solve the trigonometrically 
correct equation (10) or the linearized equation (11) with a large torsional push and a small periodic 
forcing term. 


On the top of Figure 2, the linear oscillator has settled to an oscillation of 
approximately +3deg. Since this is true of all experiments we run, we do not 
repeat the picture, but simply remark that in the linear oscillator, the oscillation 
has died off. 

The bottom half of Figure 2 shows that the torsional oscillator of equation (10) 
has settled into a periodic oscillation of large amplitude. The single large push at 
the start of the experiment has induced a permanent large amplitude torsional 
oscillation. This solution represents a torsional oscillation of about one radian, 
with a period of about 5.2, and a vertical amplitude at the sides of about the 
correct amount of 10 meters. The period is a little larger than the range of 
4.25—5.00 reported in the bridge, but it is certainly a reasonable approximation. 

Now we solve the initial value problem for the trigonometric oscillator with the 
initial conditions 9(0) = 0 and 6(0) = 0. This time, it eventually settles down to 
the small periodic oscillation shown in Figure 2. The results of solving the 
trigonometric oscillator for large and small torsional pushes is contrasted in 
Figure 3. If the initial values remain small, one can end up in what is basically the 
linear situation: compare the top graphs in Figures 2 and 3. On the other hand, 
the large torsional push can result in the large amplitude oscillation shown in the 
bottom graphs of Figures 2 and 3. 

This is the phenomenon that we wish to emphasize: the trigonometric oscillator 
can have several different periodic responses to the same periodic forcing term. Which 
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"0 O01 02 03 04 05 06 07 08 09 1 


CN 


Figure 3. Eventual behaviour of the trigonometrically correct oscillator combining a large torsional 
push (bottom) or small (top), both with a small torsional forcing term with p at 1.2. 


response eventually results can be determined by a simple transient event such as a large 
single push. 

Let’s explore this a littke more. We take pw = 1.3. Also, take A = 0.02, enough 
to induce an oscillation in the linear oscillator of +1deg. Needless to say, even 
after the large push, the linear oscillator settles down after a large time to 
near-equilibrium, as seen in Figure 4. Here, the period, matching that of the 
forcing term, is a more realistic 4.83. Again, the large push induces a permanent 
large torsional oscillation in the trigonometric oscillator, of slightly smaller ampli- 
tude than the earlier case, but still huge. 

The same results occur if we take pw = 1.4, where we get a large amplitude 
oscillation with a period of 4.5, corresponding closely to that reported (about 4.3) 
at the start of the torsional oscillation on the bridge. Later, it seems to have slowed 
down to 5.0. 

Similar results were obtained for w = 1.5 but they disappear above this value 
until one approaches values of py that are integer multiples of the values we have 
been discussing. At these values subharmonic behaviour occurs, as we discuss in 
Section 3.2. | 


0 O01 02 03 04 05 06 07 08 09 1 


"0 O01 02 03 04 05 06 07 08 09 1 


Figure 4. Multiple periodic solutions similar to Figure 3, but with w changed to 1.3. 
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0 01 02 03 04 05 06 0.7 08 09 1 


0 01 02 03 04 05 06 07 08 09 1 


Figure 5. Multiple periodic solutions similar to Figure 3, but with » changed to 1.4. 


This is the most important conclusion of this article: over a range of frequency 
that is close to that observed before the collapse of the Tacoma Narrows, the final 
oscillation that results in. the trigonometric oscillator after a large time can be 
either small or very large. All it may take is a single push to change the eventual 
outcome from small near-equilibrium behaviour to large torsional periodic mo- 
tions. 


3.2 Subharmonic responses. Anyone familiar with nonlinear dynamics who has 
read this far doubtless wonders why we are emphasizing responses of the same 
period as the forcing term. In fact, we expect that the periodic solutions of (10) 
discussed in the previous section can be sustained, not just by the forcing term of 
the same period but with one of double or triple the period. 

In Figure 6, we show two periodic solutions corresponding to a forcing term 
Asin pt with pw = 2.6. The solution is similar to that found at uw = 1.3. The figure 
looks a little different because the last sixteen periods of the forcing term are 
shown so the x-scale is half the one shown in Figure 4. 

All the remarks of the previous section apply to solutions that are subharmonic 
responses. 


0 01 02 03 04 05 06 0.7 08 09 1 


0 
=I 
0 O01 02 03 04 05 06 0.7 08 09 1 
Figure 6. A subharmonic response to periodic forcing. Here, with w = 2.6, the large and small initial 


conditions give rise to either a small linear response of the same period or to one of twice the period 
similar to that seen around p = 1.3. 
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The most intriguing values of yw to investigate may be when it is near 4. This is 
because the vertical motions that preceded the torsional ones had a frequency 
(about 40 per minute) that corresponded to approximately this value, strongly 
suggesting the existence of some forcing term of this frequency. | 

Sure enough, when the nonlinear system is forced at this frequency, one obtains 
a periodic response to a large initial condition and a small forcing term of 
approximately the right frequency and magnitude. This is shown in Figure 7, where 
a forcing term Asin 4f¢ is taken, and combined with either a large or small initial 
condition. Varying the initial conditions can result in either small linear responses 
of the right period (with small initial conditions) or large nonlinear responses 
similar to those described earlier, with period three times that of the forcing term. 


—_—_—_L_ 


0 O01 02 03 04 05 06 07 08 09 1 


0 
=] 
0 01 02 03 04 05 06 0.7 08 09 1 


Figure 7. These torsional oscillations corresponds to large and small initial conditions if pw = 4, 
corresponding to the frequency of the earlier vertical oscillations in the bridge, before the onset of 
torsional motion. 


3.3 Some transient results. Now we illustrate how dramatically different the 
transient behaviour is for the trigonometrically correct and the linearized system. 
We solve the initial value problem for a variety of different initial conditions and 
compare the two systems. Throughout this subsection, where a forcing term has 
been used, it is of the form Asin(1.2¢), although similar results would occur 
around mw-= 1.3 or 1-4. 

First, the good news. 

If we solve the initial value problem with no forcing but with large initial 
conditions, the results over the first hundred periods for the two problems are 
shown in Figure 8. There is little real difference, as the damping takes over and 
both systems settle back to equilibrium. 

Now, do the same experiment starting with initial conditions at equilibrium, but 
with a small forcing term A = 0.05. Again, the two systems give close results. This 
is shown in Figure 9. 

So far, so good! When subjected separately either to small periodic forcing or to 
a large transient displacement from equilibrium, the system responds with much 
the same behaviour in the linear and nonlinear models. 

Now to the results that we have already discussed. If we combine the previous 
two effects, the principle of superposition predicts that the linear system will die 
down as before. Figure 10 shows this, as well as the huge difference caused by 
doing the correct trigonometry. The trigonometrically correct model continues 
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Figure 8. The result of a large push from equilibrium, in both the linear and correct models, with no 
forcing term. 


“0 O1 02 03 04 05 06 0.7 08 09 1 


u4 


0.2 ——__-__—-— 
0.1 } 
OF 

—0.1FH | 


~ 020-9702 03 04 05 06 07 08 09 4 


Figure 9. The result of a small forcing term starting at equilibrium. Both systems start to die down 
immediately. 
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Figure 10. The transient behaviour resulting from combining the two influences shown separately in 


Figures 8 and 9. Predictably, the linear model starts to die down but the nonlinear one goes into large 
oscillation. 
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indefinitely in a large torsional oscillation, eventually settling to the large periodic 
motion introduced in Figure 2. 

Finally, we show how sensitive the correct system is to the amplitude of the 
forcing term. Figure 11 shows the effect of changing the amplitude of the forcing 
term from A = 0.05 (top), which results in the large amplitude motion, to A = 0.04 
(bottom), when the motion begins immediately to die down. 
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Figure 11. A slight change in the amplitude of the forcing term can make a huge difference in the 
nonlinear model. The effects of a large push with A = 0.04 and A = 0.05 are compared. The slight 
increase is enough to change the eventual behaviour from small near-equilibrium motion to large and 
destructive motion. 


3.4 A quick summary. Performing the unnatural act of linearizing the trigonomet- - 
ric oscillator of equation (10) has the effect of removing a large class of large 
amplitude behaviours. 

As shown in Figures 8 and 9, the linear model is good in the presence of a large 
transient push and no forcing, or if one starts off at equilibrium with small forcing. 
However, when these two effects are combined, it completely fails to predict the 
eventual long-term behaviour of the real system (10). 

In the trigonometrically correct model, over a wide range of frequency, several 
different amplitude oscillations can exist for the same periodic forcing term. Which 
one the system ends up on can depend on simple transient events, such as a single 
large push. Large amplitude solutions can result as a subharmonic response to a 
higher frequency forcing term and are of roughly the right magnitude and fre- 
quency when compared to the oscillation of the Tacoma Narrows bridge in its final 
torsional oscillations. 

We emphasize that there is nothing high-tech or controversial about these 
results. 

It is well known that a periodically forced pendulum equation such as (5) 
exhibits multiple solution and chaotic behaviour [10]. Perhaps the only surprise is 
that when one makes reasonable guesses for the constants based on the available 
literature, the resulting periodic motions are so good a match to the historical 
data. 

One simply recognizes that trigonometry has been artificially removed from the 
problem, restores it, and uses a good Runge-Kutta initial value solver to find the 
long-time behaviour of the system. 
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There remain two other things to understand: first, what was the origin of the 
original vertical oscillations of up to 5 ft. in amplitude and with a frequency of 
about 40 per minute? If we assume these oscillations were approximately sinu- 
soidal, accelerations must have been approaching that of gravity. 

Second, one can ask about the origin of the sudden transition from vertical to 
torsional motion. On this question, we may need to be more speculative or 
controversial, as we shall see in the next section. However, as regards the large 
amplitude torsional periodic solution, we believe the explanation of this section is 
satisfactory. 


4, WHAT IF THE SPRINGS LOSE TENSION BRIEFLY? THE TRANSITION TO 
TORSIONAL MOTION. So far, we have focussed on the least controversial 
results: namely, if one does not artificially remove the trigonometry from the 
torsional oscillator, one gets a realistic explanation for the oscillations seen in the 
Tacoma Narrows bridge before its collapse. We have largely avoided the question 
of what happens when the two equations (3) and (4) are coupled due to periodic 
slackening of the cables. 

If this does happen, the structure of the periodic solntions becomes consider- 
ably more complex. A start to studying this system and hints of the complexities 
that arise can be found in [12]. 

There is some controversy about this question. I and co-workers claim that with 
accelerations reaching that of gravity, and with magnitudes of up to 28 ft. every 
four seconds, there must have been some slackening of the cables. Some engineers 
insist, to the verge of apoplexy, that this cannot happen [3]. A third group take a 
middle path, claiming that the cables can slacken only in a brief and transitory 
fashion [3]. 

In this final section, we follow the middle path and ask what iappens to the 
initial value problem solution of the full nonlinear system, if the cables briefly lose 
tension due to a vertical motion. | 

Here, we come to a new and unexpected conclusion: a purely vertical nontor- 
stonal motion in which the cables lose tension can become disastrously unstable even in 
the presence of tiny torsional forces, setting up a rapid transition to large amplitude 
torsional motion. This might be the explanation for the sudden transition observed 
at Tacoma Narrows. 

We have no,good mathematical explanation for this phenomenon. However, we 
can demonstrate it in a sequence of mathematical experiments. 

We start with the initial value problem (3) and (4), with a fairly large initial 
impulse in the vertical direction and no torsional oscillation. Figure 12 shows the 
effect of a large push in the vertical direction, y(0) = 26 with tiny torsional forcing 
(A = 0.002). What you get is exactly what you expect: the cables loosen briefly for 
about two periods (falling below y = 0), but the torsional motions are confined to 
about + 0.003, essentially undetectable. 

Now repeat the same experiment, changing only the magnitude of the initial 
push in the vertical direction, to y(0) = 31. The results are shown in Figure 13. 
The vertical y-motion (top) is behaving much as before, but the character of the 
torsional motion has totally changed. The additional vertical push has resulted in a 
completely different torsional oscillation. There is initially almost none, and then, 
virtually instantaneously, it changes to large amplitude torsional oscillations of 
magnitude approaching 1 radian. Since there was essentially no torsional forcing 
term, damping eventually takes over and the motion settles down to the near 
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Figure 12. Vertical (top) and torsional (bottom) responses to a large push in the vertical direction. As 
intuition would suggest, the vertical oscillation is large but settles down quickly. Torsional oscillation 
with only a tiny forcing term is essentially invisible. 
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Figure 13. A somewhat larger initial push in the vertical direction, but with everything else the same as. 
in Figure 12, gives rise to a sudden transition to large amplitude torsional oscillation. 


equilibrium behaviour. Of course, if there had been no torsional forcing, as 
opposed to tiny, there would have been no torsional result. 

Suppose we have no torsional forcing term. Then if 6(0) = 0, the result is pure 
vertical motion. Now, repeat the same experiment with @(0) = 0.001. The result is 
shown in Figure 14: nothing torsional for a while, followed by a sudden jump to 
violent torsional (up to 50 deg), followed by eventual decay to equilibrium as the 
damping takes over. 

We conjecture that this might have been the situation at the instant of 
changeover at Tacoma Narrows from vertical to torsional oscillation, which ap- 
peared to take place almost instantaneously. There was already a large amount of 
vertical kinetic energy built up and a small torsional forcing term could have been 
enough to change the motion. 

The reader probably can guess where we are heading. Suppose we have a small 
amount of periodic torsional forcing, enough to sustain a +3deg motion near 
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Figure 14. No torsional forcing term, but a tiny displacement in the initial value of @(0) is also enough 
to induce violent torsional oscillation, which, in the absence of periodic forcing, eventually dies down 
due to damping. 


equilibrium. We have already seen how a single large torsional push could induce a 
large-amplitude periodic torsional motion. Now we consider the influence of a 
large vertical push combined with the small amount of periodic torsional forcing. 
The result is shown is Figure 15. 

Here we have taken pw as 1.3, in the middle of the range where we expect 
multiple solutions. We have taken A = 0.02, about enough to induce linear 
oscillation of about +2 deg. We have given a sufficiently large push in the vertical 
direction to cause the springs to lose tension for the first five cycles. The result is a 
couple of cycles of small torsion, followed by an almost instantaneous transition to 
huge torsional motion, which remains permanent. It eventually settles down to the 
permanent periodic torsional motion shown in Figure 4. 

_ We believe that we have discovered a convincing explanation for the mystery of 
the sudden transition to torsional motion. A large vertical motion had built up, there 
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Figure 15. What really happened to induce torsional oscillation at Tacoma Narrows? A small but 
periodic torsional force combines with a large vertical transient push to produce a rapid transition to 
large torsional oscillation as the vertical oscillation is damped away. 
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was a small push in the torsional direction to break symmetry, the instability occurred, 
and small aerodynamic torsional periodic forces were sufficient to maintain the large 
periodic torsional motions, as shown in Figure 15. 


5. SOME CONCLUDING COMMENTS. No mathematical model is ever perfect. 
Turing said it best: “This model will be a simplification and an idealization, and 
consequently a falsification. It is to be hoped that the features retained for 
discussion are those of the greatest importance in the present state of knowledge” 
[32]. Let us review some of the short-comings of our paper. | 

Following the engineering literature, we have treated the cable-suspension 
structure as a torsional oscillator supported by springs that remain linear until they 
reach the unloaded state. We doubt that a bridge oscillating up and down by about 
10 meters every 4 seconds obeys Hooke’s law. 

Our model slightly understates the period of the large amplitude torsional 
oscillations. This is probably due to the fact that there is additional resistance to 
torsion from the road-bed, adding an extra spring constant to equation (10). 

Our model says very little about the vertical oscillation that preceded the 
torsional oscillation. It may explain why the vertical motion was so rapidly 
converted to torsional motion, but has little to say about why this original motion 
started and continued. Nor does it say much about the rather mysterious complex 
torsional motion that actually occurred, namely one that alternated between 
one-noded and no-noded oscillations. 

However, it still gives a remarkably “low-tech” explanation of two of the 
phenomena, the large amplitude torsional motions and the transitional motions. 

There remains a great deal to do on the coupled system. With a reliable 
Runge-Kutta solver and unlimited computer time (and some patience), the reader 
can discover new phenomena in the solution set of this system. Certainly, this type 
of experimentation makes for interesting class projects in the undergraduate 
environment. | 

A mathematical explanation for the apparently unstable nature of the large 
amplitude vertical oscillation in which cables lose tension would be desirable. 
Although large amplitude vertical oscillations have been investigated and their 
one-dimensional stability proved [13], we have no proof of their instability in the 
torsional direction. An intuitive argument might be advanced that if the rod is ever 
in the situation where one spring is under tension and the other is not, this 
introduces a new large torsional force. | 

Finally, we are not sure of the consequences of this work for modern suspension 
bridges in earthquake conditions. Part of the dilemma, as one leading bridge 
engineer has lamented, is that there is a “lack of open discussion” on these 
problems [6]. 

It is not clear whether, in their calculations about earthquake responses, 
engineers have taken into account the potentially catastrophic consequences of a 
brief loosening of the cables and the ensuing large amplitude torsional oscillations 
that can result. To judge by the literature, they are still making the small 
oscillation linearization, which can be so misleading once away from equilibrium 
[1]. Since hundreds of millions of dollars are being spent in an effort to strengthen 
suspension bridges in California in preparation for large earthquakes, this question 
may not be entirely academic. 

Finally, it is worth remarking that our results illustrate how the availability of 
inexpensive computation is changing the entire culture of mathematics. Now, new 
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and interesting results can be discovered in the undergraduate classroom. We can 
now investigate numerically simple systems such as (4) and (5), and we uncover 
beautiful new properties that we could not have suspected previously. We may be 
witnessing the dawn of a new golden age of discovery in nonlinear oscillations. 


REFERENCES 


1. 


A. M. Abdel-Ghaffar, Suspension bridge vibration: Continuum formulation, Jour. Eng. Mechanics, 
A.S.C.E. 108 (1982) 1215-1232. 


O. H. Amann, T. von Karman, and G. B. Woodruff, The Failure of the Tacoma Narrows Bridge, 
Federal Works Agency, 1941. 


D. Berreby, The great bridge controversy, Discover 13 (1992) 26-33. 


K. Y. Billah, and R. H. Scanlan, Resonance, Tacoma Narrows bridge failure, and undergraduate 
physics textbooks, Amer. J. Physics 59 (1991) 118-124. 


F. Bleich, C. B. McCullough, R. Rosecrans, and G. S. Vincent, The Mathematical Theory of 
Suspension Bridges. U.S. Dept. of Commerce, Bureau of Public Roads, 1950. 


A. Castellani, Safety Margins. of suspension bridges under seismic conditions, ASCE J. Structural 
Engineering 113 (1987) 1600-1616. 


A. Castellani, and P. A. Felotti, A note on Hae vibration of suspension bridges, ASCE J. 
Structural Engineering 112 (1986) 2169-2173. 


Y. Chen, and P. J. McKenna, Traveling waves in a nonlinearly suspended beam: theoretical results 
and numerical observations, J. Differential Equations, 136 (1997) 325-355. 


Y. S. Choi, K. C. Jen, and P. J. McKenna, The structures of the solution set for periodic 
oscillations in a suspension bridge model, IMA J. Appl. Math. 47 (1991) 283-306. 


P. Blanchard, R. L. Devaney, and G. Hall, Differential Equations, PWS Publishing, Boston, 1996. 


S. H. Doole, and S. J. Hogan, A piecewise linear suspension bridge model: Nonlinear dynamics 
and orbit continuation, Dynamics Stability Systems 11 (1996) 19-47. 


S. H. Doole, and S. J. Hogan, Torsional dynamics in a simple suspension bridge model, Applied 
Nonlinear Mathematics Research Report Number 9.96, University of Bristol, Bristol, 1996. 


J. Glover, A. C. Lazer, and P. J. McKenna, Existence and stability of large-scale nonlinear 
oscillations in suspension bridges, Z. Angew. Math. Phys. 40 (1989) 171-200. 

D. Jacover and P. J. McKenna, Nonlinear torsional flexings in a periodically forced suspended 
beam, J. Comput. Appl. Math. 52 (1994):241-265. 

Theodore von Karman, The Wind and Beyond, Theodore von Karman, Pioneer in Aviation and 
Pathfinder in Space, Little Brown and Co. Boston, 1967. 

A. C. Lazer, and P. J. McKenna, Large amplitude periodic oscillations in suspension bridges: some 
new connections with nonlinear analysis, SIAM Review 32 (1990) 537-578. 


A. C. Lazer, and P. J. McKenna, Large scale oscillatory behaviour in loaded asymmetric systems, 
Analyse Nonlinéaire Annales de L’ Institut Henri Poincare 4 (1987) 243-274. 


P. J. McKenna, and W. Walter, Nonlinear oscillations in a suspension bridge, Arch. Rat. Mech. 
Anal. 98 (1987) 167-177. 


P. J. McKenna, and W. Walter, Travelling waves in a suspension bridge, SLAM J. Appl. Math. 50 
(1990) 703-15. 


I. Peterson, Rock and Roll Bridge, Science News 137 (1990) 344-346. 

H. Petroski, Still twisting, American Scientist, September—October, 1991. 

Mario Salvadori, personal communication. 

R. H. Scanlan, Developments in low-speed aerOSIeSHO in the civil engineering field, AIAA 
Journal 20 (1982) 839-844. 

R. H. Scanlan, Airfoil and bridge deck flutter derivatives, Proc. Amer. Soc. Civ. Eng. Eng. Mech. 
Division EM6 (1971) 1717-1737. 

R. H. Scanlan, The action of flexible bridges under wind IJ: buffeting theory., J. Sound and 
Vibrations 60 (1978) 201-211. 

R. H. Scanlan, and J. J. Tomko, Airfoil and bridge deck flutter derivatives, Proc. Amer. Soc. Civ. 
Eng. Eng. Mech. Division EM6 (1971) 1717-1737. 


1999] LARGE TORSIONAL OSCILLATIONS heh 


27. R.H. Scanlan, and J. W. Vellozi, Catastrophic and annoying responses of long-span bridges to 
wind action, Annals New York Acad. Sciences 352 (1980) 247-263. 

28. F. D. Schwarz, Why theories fall down, American Heritage of Inventions and Technology 8 (1993) 
6-7. 

29. F. D. Schwarz, Still Falling, American Heritage of Inventions and Technology 9 (1993) 7. 

30. Seattle Times /Seattle Post-Intelligencer, November 5, 1990 p. 12. 

31. J. L. Synge, and B. A. Griffith, Principles of Mechanics, McGraw-Hill, New York, 1959. 

32. A.M. Turing, The chemical basis of morphogenesis, Philos. Trans. Royal Soc. Ser. B 237 (1952) 
Ouse. 

33. P. Vielsack, and H. Wei, Sensivity of the harmonic oscillation of a suspension bridge model with 
asymmetric dissipation, Arch. Appl. Mech.—Ingenieur Archiv. 64 (1994) 408-416. 


P. J. MCKENNA did his undergraduate work in Dublin (at U.C.D.) and his graduate work in Ann 
Arbor. The central theme of his research is nonlinear analysis, in particular, the existence, multiplicity, 
and numerical approximation of solutions of nonlinear boundary value problems. The work arises 
naturally from his research in multiple periodic solutions of Hamiltonian systems. Kristen Moore, his 
doctoral student, is extending this analysis to the partial differential equations that describe the spatial 
behaviour along the length of the bridge. Movies showing computed solutions can be seen at 
http: \\www.math.uconn.edu\kmoore\ 

University of Connecticut U-9, Storrs, CT 06269 

mckenna@math.uconn.edu 


Cicero on mathematics.... 


For indeed you cannot fail to remember that the most learned men 
hold what the Greeks call ‘philosophy’ to be the creator and 
mother, as it were, of all the reputable arts, and yet in this field of 
philosophy it is difficult to count how many men there have been, 
eminent for their learning and for the variety and extent of their 
studies, men whose efforts were devoted, not to one separate 
branch of study, but who have mastered everything they could 
whether by scientific investigation or by methods of dialectic. Who 
does not know, as regards the so-called mathematicians, what very 
obscure subjects, and how abstruse, manifold, and exact an art they 
are engaged in? Yet in this pursuit so many men have displayed 
outstanding excellence, that hardly one seems to have worked in 
real earnest at this branch of knowledge without attaining the 
object of his desire. Who has devoted himself wholly to the cult of 
the Muses, or to this study of literature, which is professed by those 
who are known as men of letters, without bringing within the 
compass of his knowledge and observation the almost boundless 
range and subject-matter of those arts? 


De Oratore, I. iii. 9-10 
Contributed by Adi Ben-Israel, Rutgers University 
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Inverse Conjugacies and Reversing 
Symmetry Groups 


Geoffrey R. Goodson 


INTRODUCTION. We present some elementary group theory that arises in the 
theory of time-reversing dynamical systems. Let G be a (usually non-abelian) 
group and let a € G be a fixed element. The set 


C(a) = {x € G: xa = ax}, 


the centralizer of a, is a subgroup of G that contains (a), the cyclic subgroup 
generated by a. Our aim is to study the skew centralizer 


_B(a) ={xeG:xa=a™'x}. 


This paper arose from a course I gave on algebraic structures, where some of 
the results of Sections 1 and 2 and some examples from Section 4 were presented 
as exercises and then discussed in the classroom. In addition, the students were 
asked to calculate B(a) and C(a) for certain specific examples, sometimes with the 
aid of a software package. 

Generally B(a) is not a subgroup of G, and it may be empty. However, 
E(a) = B(a) U C(a) is a group, which is called the reversing symmetry group of a. 
In dynamical systems theory, the group element a represents the time evolution 
operator of a dynamical system. We present some results familiar to people 
working in time reversing dynamical systems, but our presentation is given in an 
abstract setting, entirely from an elementary group theoretic point of view, in the 
hope that it will be of interest to teachers of a first course in group theory. 

Section 1 gives some of the elementary properties of B(a). We see that B(a) is 
a group if and only if a is an involution, i.e., a* = e, the identity of G. In Section 2, 
we prove (following Lamb [8]) that E(a) is a group having C(a) as a normal 
subgroup. In dynamical systems, the case where a has infinite order is of most 
interest, but we show that there are interesting finite groups, such as the dihedral 
and dicyclic groups, that arise in a natural way from the study of E(a). In Section 3 
we study the inner automorphisms of E(a) and apply them when G is a topological 
group. We give particular emphasis to the situation when {s*:s € B(a)} is a 
singleton set. Inverse conjugacies involving the permutation groups and some 
infinite groups originating in dynamical systems theory are our focus in Section 4. 

In Section 5 we mention briefly the dynamical origins of the ideas discussed 
here, restricting our attention to the ergodic theory of measure-preserving trans- 
formations. 


1. ELEMENTARY PROPERTIES OF B(a). Elements a, b € G are said to lie in 
the same conjugacy class (that is, a and b are conjugate) if there exists some x € G 
satisfying a = x ‘bx. Thus a & G is conjugate to its inverse if B(a) # ©. 

The set B(a) is a group only in special circumstances. If it is a group, then B(a) 
contains the identity of G, which we denote by e, so ea = a~'e, a =a ' ora’ =e. 
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This in turn implies that Bla) = {x € G: xa =a 'x} = C(a). On the other hand, 
if B(a) = C(a), then B(a) is a group. Let us also mention that if B(a) N C(a) is 
non-empty, then we must have a” = e. We have proved: 


Proposition 1. For a group G and a given a € G with B(a) # ©, the following are 
equivalent: 
(i) B(a) is a subgroup of G. 
(ii) a is an involution, i.e., a” = e. 
(iii) BCa) = C(a). 
(iv) Bla) A Cla) # ©. 


If G is an abelian group, then C(a) =.G, so from Proposition 1, either 
B(a) = C(a) (when a? = e), or B(a) = @. If any of the conditions of Proposition 1 
hold, we say that we have the ¢rivial case. 

In general, B(a) may have elements of all even orders, and also of infinite 
order. However, if there exists an x € B(a) with x” = e for some odd n € Z, then 


a= A 


xa=a'x>x"a=a'x" >a=a'>a@ =e, 


SO we again have the trivial case. 
On the other hand, suppose x € B(a) is of order 2n and 2” is the highest 
power of 2 dividing an, ie., 2n = 2”k for some odd k € Z. Then y = x* € B(a) 


and y*” =x*?" =x?" = e, so B(a) also contains elements of order 2” (see [9, p 
14]). 
Notice that a € G is conjugate to a‘ if and only if there are u,v EG 


such that 


a=vu-! and u* =v’. 


To prove this, take any w such that aw = wa~'. Then (aw)* = awaw = wa ‘aw = 
w*. Set v = aw € B(a) and u=w. Then a =vu"' and v? = (aw)? = Ww? = u’. 


Conversely, suppose a = vu‘ and u* = v*. Then au = vu-'u =v and ua! = 


u(vu-!)~' = u*v7! = v*v ! =v, ie. au = ua. 


The case where there exists an involution in B(a) is important in the dynamical 
systems literature. We now see that a is conjugate to a~' via an involution if 
and only if there are u,v © G such that a = uv‘ and u* = v* =e (see [9, p. 13] 
and [4]). 


2. THE REVERSING SYMMETRY GROUP E(a) = B(a) U C(a). We claim that 
the set E(a) = B(a) U C(a), is a subgroup of G. This is clear if B(a) is empty, 
sO assume it is non-empty. Taking inverses of both sides of xa =a~'x gives 
a~'x~! =x7!a, so 
x €B(a)@x"'€B(a) and xa=a''x eax =xa"'. 

Since B(a) and C(a) are closed under the taking of inverses, so is E(a). 

Let x,y € E(a). If x, y € C(a), then axy = xay = xya. Therefore xy € C(a) and 
so xy € E(a). 

If x,y € Bla), then axy =xa'y =xya. Therefore xy € C(a) and again 
xy € E(a). 

The third possibility is x € B(a), y € C(a). In this case axy = xa” 'y = xya~', so 
xy € B(a), and xy € E(a). 

The proof that E(a) is a group is completed on noting that e€ Cla) C E(a). 
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The preceding argument shows that C(a) is a subgroup of E(a). Suppose a* # e 
and B(a) # ©, and let x, y € B(a). Then x" 'y € C(a) and y € x- C(a). It follows 
that B(a) Cx-C(a) and in a similar way that x-C(a) C B(a). In particular, the 
cosets of C(a) in E(a) are C(a) and x: C(a) = B(a). 

Clearly x-C(a)-x~' = C(a) for all x € E(a), so C(a) is a normal subgroup of 
E(a), and we see that E(a)/C(a) = Z,. If (a) = {a":n © Z} is the cyclic sub- 
group generated by a, then (a) is a normal subgroup of both C(a) and E(a). 


If G is a finite group having even order, then it is an easy exercise to show that 
there exists some a € G, a # e, with a? = e; the conditions of Proposition 1 are 
satisfied for a, so B(a) = C(a) # ©. Consequently, finite groups of even order 
always contain nontrivial elements that are conjugate to their inverse. 

On the other hand, finite groups of odd order never contain a nontrivial 
element that is conjugate to its inverse. Suppose the order of G is odd and there is 
an a € G,a # e, that is conjugate to its inverse. Lagrange’s theorem implies that a 
cannot be of even order, so a? # e. By assumption, B(a) # @, so Proposition 1 
implies that E(a) = B(a) U C(a) is a disjoint union. Since the cosets of C(a) in 
E(a) are C(a) and B(a), E(a) is a subgroup of G of even order. Lagrange’s 
Theorem again tells us that this is impossible. 


The fact that E(a) is a group with E(a)/C(a) = Z, appears in [8] and 
[9, p. 9-12]. However, the following may not be well known: 


Proposition 2. Let a € G with B(a) # ©. 


(i) If x € B(a) and x* € (a), then x* =e. In particular, if the order of <a) is 
infinite or odd, thenx? =e. — 
(ii) If C(a) = <a), then {x*: x € B(a)} is a singleton subset of C(a). 
(iii) If {x*:x € B(a)} is a singleton set, then the order of x divides 4 for all 
x © Bla). 
(iv) The center of E(a) is a subgroup of C(a). 


Proof: (i) If x € B(a) and x* € (a), then x* = a” for some n € Z. Since 


2 Z 


xa=a ix =xa"=a"x>x-x* =x’? x ax’ =e, 


we see that the order of x divides 4 and hence the order of x’ divides 2. If 
x* € (a), a cyclic group having infinite (or odd) order, then (a) cannot have any 
elements of order 2. Therefore x’ = e. 

Gi) In a similar way, if x,, x, € B(a), then x,x, € C(a) = (a), so x,x, =a” for 
some n € Z. Also x,a" =a-"X,, SO X,X,X> = (x,xX,)"'x,, Or xf =xz* =x5, sO 
{x?: x € B(a)} is a singleton set. 

(iii) If x € B(a), then x° € B(a) so x”? = (x°)? = x°, and this implies x* = e. 

(iv) If x is in the center of E(a), then xg = gx for all g € E(a). In particular, 
xa = ax,so x © C(a). 


Since (a) is a normal subgroup of C(a), the quotient group C(a)/<a) is well 
defined. This group is sometimes called the essential centralizer of a. Proposition 


2(ii) can be generalized to: 


If a € G has infinite order and C(a)/<a) has order m for some m € Z", then 
every x € B(a) has order 2k for some k that divides m. 


1999] INVERSE CONJUGACIES AND REVERSING SYMMETRY GROUPS 21 


3. THE INNER AUTOMORPHISMS OF E(a). The inner automorphisms of E(a) 
have the form ¢,(x) = sxs~' for some s € E(a). Since C(a) and (a) are normal 
subgroups of E(a), they are preserved by ¢, for all s € E(a). Furthermore, if 
s € B(a) and x € a), then $x) = x~'. This is because x = a” for some n € Z, 
and so #,(x) =sa"s"'=a-"ss'' =a" =x"!. 

When is it true that ¢(x) =x! for all x € C(a)? Let us say that s € B(a) 
conjugates C(a) to C(a)~' if sx =x~'s for all x € C(a), or, equivalently, if 
(x) =x7! for all x € C(a). 

we can now show that every s € B(a) conjugates C(a) to C(a)~' if and only if 
{s?:s € B(a)} is a singleton set. 

To see this we use the fact that B(a) = sC(a) for each s € B(a). If s € Ba) 
conjugates C(a) to C(a)~' then sx =x ~'s for all x € C(a). This implies that 
(sx)* = s7, and the result follows. 


Conversely, suppose that {s*:s € B(a)} is a singleton set. Then (sx)* = s* for 
any s © B(a) and x € C(a). This immediately gives sx =x~'s, so s conjugates 
C(a) to C(a)™". 

We leave it to the reader to show that if {s*: s € B(a)} is a singleton set, then 
C(a) is abelian. 


Our aim now is to apply the preceding results to the case where G is a 
metrizable topological group. Note that if (a) is dense in C(a), then C(a) must be 
abelian, and either C(a) = <a) or C(a) is uncountable. An immediate conse- 
quence of the next theorem is that if (a) is dense in C(a) and B(a) # ©, then 
every s € B(a) conjugates C(a) to C(a)~'. 


Theorem 1. Jf (a) is dense in C(a) and Bla) # ©, then {s*:s € B(a)} is a 
singleton set. 


Proof: We are given that {a":n € Z}= C(a). Let x,, x, € B(a), so x,x, € C(a). 
We use an argument similar to that in Proposition 2(ii). 
There is a subsequence {n,} of integers for which x,x, = lim,_,,,a”i. Further- 
more, ax, =x,a_‘ implies that a”x, =x,a~"' for all i. 
Letting i — ~ and using the continuity of multiplication and inversion in G, we 
obtain 
; 


XyXyX_ = 42X,(x,x,) 0° =x xz x7! > xx, =z ay! > x} = x7? 
1XgXy = XC Xj Xq XQ Xy aX, =X_ xy 2 =X". 


The fact that B(a) is closed under the taking of inverses now gives x7 = x4, so 
{x?: x € B(a)} is a singleton set. fe 


If x € B(a), then we cannot have a &€ {x*":n © Z} as this would imply a’ = e, 
the trivial case. Whenever G is a topological group, the set {x*”:n € Z} C C(a) is 
never dense in C(a) for any x € B(a). 

This section is based on [3] and [4]; see also [10], where inner automorphisms on 
E'(a), the reversing k-symmetry group are discussed. 


4. EXAMPLES. 
(i) Dihedral and Dicyclic Groups. The dihedral group D,, of order 2n arising from 
the symmetries of a regular plane n-gon is 


a, xa" =é,.x" =e, x “ax =a *): 
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The dicyclic groups of order 4m are 
(a,x:a°™ =e,x* =a", x lax=a''). 
See [1, p. 6-8] or [12, p. 65-66] for a discussion of these groups. 


If a has finite order n and C(a) = (a), Proposition 2(ii) says that x* = e if 
x € B(a). Essentially two different cases arise: 


(a) If x* =e, then E(a) = D,. 
(b) If x? #e, then x* =a””’. In particular, n is even and E(a) is a dicyclic 
group of order 2n. 


As a concrete example, consider the smallest of the dicyclic groups, the 
quarternion group QO, = {+1, + i, + j, + k} with multiplication given by i* = j* = 
k? = -1, ij =k, jk =i, ki=j, and ji= —k, kj = —i, ik = —j and the usual 
rules for multiplying by +1. Then C(@i) = {41, + i} and Bi) = {+j, + k}. Since 
C(i) = <i) is a cyclic group, {x7 : x € B(i)} = {—1} is a singleton set. 


(ii) Permutation Groups. Every element of the permutation group S, on n 
symbols is conjugate to its inverse. This is because any permutation in S$, can be 
resolved into a product of disjoint cycles in a unique manner except for the order 
in which the cycles appear. Two permutations are in the same conjugacy class if 
they have the same cycle pattern. For example, in S, the permutations 


x = (3,5,6)(2,4)(7,1) and y = (2,4,1)(3,5)(6,7) 


have the same cycle pattern, so they lie in the same conjugacy class. It is clear that 
every permutation and its inverse have the same cycle pattern. 

Not every element of the alternating group A, need be conjugate to its inverse. 
For example in A,, the cycles (1,2,3) and (1,2,3)~' are not conjugate. This is 
because B((1,2,3)) (in S,) consists solely of odd permutations. However, every 
element of A, is conjugate to its inverse ({12, p. 60]). Examples of infinite groups 
with this property are given in [7]. 

Note that the element o = (1, 2), 4) of S, has order 2, so C(a) = B(o). Also, 
Tt = (2,3, 4) € C(o) is of order 3 and ta = o'r. 


(iii) An Infinite Group. Let G = (a, k> be the finitely generated group subject to 
the relations kak~'! = a~! and k* = e. Then C(a) = (a, k*), and if x € B(a) then 
for some n € Z either x = ka” or x = k7!a". Now 


(ka")” = k?(k-'a"k)a" = k?a-"a" =k? #, 
and similarly for k~‘'a”. It follows that there are no involutions in B(a). However, 


|ka"| = 4 and {x*: x € B(a)} is a singleton set. It follows that B(a) conjugates C(a) 
to C(a)"'. 


(iv) Group Rotations. Let G be a compact monothetic topological group: there 
exists some a € G for which the set {a”:n © Z} is dense in G. In this case G is 
abelian, and we assume that a* # e. A nice example is the unit circle S' in the 
complex plane, with a € S' chosen so that it is not a root of unity. 

Let & be the group of all homeomorphisms h:G — G, a subgroup of the 
permutation group of G. If we define ¢,:G — G by ¢,(g) = a-g for some fixed 
a €&G, then ¢, is a rotation of G and @¢, € &. 

We claim that C(¢,) = {¢, :b € G}, for if w € C(¢,), then 


pod, = 6,°P=> w(ag) =ay(g), forall geG. 
If we) = b, then W(a) = a-b and (a") = a" -b for all n € Z. Continuity of w 
now implies that w(g) = b-g for all g € G, or W= @,. 
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Note that ¢7'(g) = a~‘g, so if we define S:G > G by S(g) = g™', then 


S°$,(g) = S(ag) = (ag)! =a7'g* = 6; '(g7') = 6; ' 2 S(g), 
or S € B(¢,). It follows that every member of B(d,) is of the form R = So @, for 
some b € G. Now 


R? =S°¢,°S°d, =S°¢,°¢6,'°S =S? = Id, 


where Id is the identity in ¥. We conclude that every member of B(@,) is an 
involution. 


(v) Automorphism Groups. Let X be a group and let ¥ = Aut(X) be the 
automorphism group of X. If ¥ is abelian (for example if X is cyclic, or is 
the p-adic integers) then B(¢d) # © if and only if ¢ € Y is an involution. In 
the general case there is a simple way to construct automorphisms conjugate 
to their inverses: Let @ € Aut(X). Then xX #7! € Aut(X XX). Define 
wy € Aut X x X) by w(x, y) = (y, x). Then & € B(d X g7') and we = Id. 

Other less~trivial members of B(d x @~') can be described. For example, 
define yw € Aut(X X X) by Wx, y) = (y,d(x)). We can verify directly that (¢ < 
db ')ow = po(h |! X s), and w has infinite order if @ has infinite order. 


Related examples can be given using countable direct products. Let X be a 
group and write G = II;__.,. X,, where X, = X for all i € Z. Define P:G — G by 
[D(g)], = %,+1, where the subscript n denotes the nth coordinate. Then ® is just 
the familiar shift automorphism. | 

Define automorphisms P and Q of G by [P(g)], =g_, and [Q(g)l, = 21_,- 
Then ® = PQ is a product of two involutions. It follows from the last paragraph of 
Section 1 that ® is conjugate to its inverse by an involution. 

In a similar way we can construct other automorphisms of the product space 
that are conjugate to their inverses. Let U: G > G be defined by 


UO iece regs Ps Pioee) a tasna@ “(Ba $( 8). 1(81), (82), }, 


where the « denotes the 0th coordinate and ¢ is an automorphism of X. 
Note that U* = ©’, where ® is the shift map, so if we let V=@® and 
A = UV™', that is, 


Als hs Boos Bin ace) fs (...56( 8-1), '( 80). $(81)5---) 


then the same reasoning as in the preceding example implies that A is conjugate 
to A~' and U € B(A). 


5. DYNAMICAL ORIGINS. Let 7:X — X be an invertible measure-preserving 
transformation on a Borel probability space (X,¥, w). This means that T7'B EF 
and u(T~'B) = w(B) for all B €.¥. If T, and T, are two such transformations on 
corresponding probability spaces, T, is conjugate to T, if there is an invertible 
measure-preserving transformation S$: X, — X, such that ST, = T,S (when we 
write f = g in this section, we mean f(x) = g(x) for all x €_X except possibly for 
a set of «-measure zero). With every measure—preserving transformation T we 
associate a unitary operator 


Up: LX, w) > L2(X, wu); Up f(x) = f(Be). 


A measure-preserving transformation 7 is said to have discrete spectrum if 
U, has discrete spectrum, 1e., there is a complete orthonormal sequence 
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{f,} C L°CX, w) and a sequence {A,} of complex numbers (of absolute value one) 
such that f,(7x) = A, f,(x) for all n = 1,2,.... More generally, T is said to have 
simple spectrum if there exists some h € L?(X, w) such that the closed linear span 
of {UZh:n € Z} is all of L’CX, w). 

A transformation T is ergodic if for any measurable function f, the condition 
f(Tx) = f(x) implies f = constant. This is equivalent to the condition: if A € F, 
then T-'A =A implies w(A) = 0 or 1. 

Suppose G is a compact topological group with Haar measure A, and whose 
measurable sets are the Borel subsets of G. If a © G, the rotation ¢,:G > G 
given by ,(g) = a-g is a Haar measure-preserving transformation. The transfor- 
mation @¢, is ergodic if and only if (a) is dense in G. A celebrated result of 
Halmos and von Neumann [6, Theorem 4] says that an ergodic transformation T 
with discrete spectrum is conjugate to an ergodic rotation ¢,:G — G for some 
compact topological group G and some a € G. 

The characters of G have the property 


x(¢.(8)) = x(4°8) = x(4) x(8), 


i.e., they are eigenfunctions of ¢,. Conversely, every eigenfunction of @¢, is a 
character of G. It is a consequence of Pontryagin duality theory that the characters 
constitute a complete orthonormal basis for L7(G, A). 

Let TJ be an ergodic transformation with discrete spectrum. The discrete 
spectrum theorem of Halmos and von Neumann implies that T is conjugate to its 
inverse. Also, if S is measure-preserving and ST = T'S, then S* = J. Further- 
more, the centralizer of T (with a suitable topology) is a compact abelian group 
that is isomorphic to G in a natural way. Conversely, any ergodic T for which 
C(T) is compact must have discrete spectrum. 

Halmos and von Neumann asked whether every ergodic transformation is 
conjugate to its inverse via an involution. In 1951, Anzai showed that there is an 
ergodic transformation that is not conjugate to its inverse. As a consequence, this 
type of inverse-conjugacy problem lay dormant for some years. Recently it was 
shown that if a transformation 7 has simple spectrum and is conjugate to its 
inverse, then every conjugation is an involution. This generalizes aspects of the 
Halmos—von Neumann theorem [3, Theorem 1]. In addition, in [3], [4], [8], and 
[9] one finds many examples of transformations whose centralizers and skew 
centralizers have group theoretic properties similar to those discussed in this 
paper. Ergodic transformations with a wide variety of centralizers are now known. 
For an ergodic 7, C(T) can be a compact abelian group, a countable cyclic group, 
an uncountable monothetic group (hence abelian, but not locally compact), as well 
as many other abelian and non-abelian groups. A nice treatment of certain 
examples arising in this theory is given in [2]. 


6. CONCLUDING REMARKS. In the dynamical systems literature there has been 
some confusion about the role of involutory time-reversal systems. Some authors 
seem to believe that the set of reversing symmetries B(a) (Gif non empty) must 
always contain an involution. Our examples show that this is not generally the case, 
although it is true in certain special contexts. For example, if G = GL(n, R) then 
for all a € G such that B(a) # ©, B(a) contains an involution; see [13, Chapter 
2]. Related results are given in [5], where properties of real orthogonal matrices 
are studied in this context. A survey of time-reversal symmetry in dynamical 
systems is given in [11]. 
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More on Paperfolding 


Dmitry Fuchs and Serge Tabachnikov 


It is a common knowledge that folding a sheet of paper yields a straight line. We 
start our discussion of paperfolding with a mathematical explanation of this 
phenomenon. The model for a paper sheet is a piece of the plane; folding is an 
isometry of the part of the plane on one side of the fold to another, the fold being 
the curve of fixed points of this isometry (see Figure 1). The statement is that this 
curve is straight, that is, has zero curvature. 


z-—\! 


Figure 1 


If not, consider an arc y of the fold with nonvanishing curvature. Let y, be the 
curve at (small) distance ¢« from y on the concave side, and let y_ be the 
corresponding curve on the convex side, as illustrated in Figure 2. Then 

length y, > length y > length y_, 


where the difference between lengthy, and length y_ is of order e- lengthy: 
curvature y. On the other hand, the isometry takes y, to y_, so lengthy,= 
length y_. This is a contradiction. 


Figure 2 
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In spite of what has just been said, one can fold paper along an arbitrary smooth 
curve! The reader is invited to perform an experiment: draw a curve on a sheet of 
paper and slightly fold the paper along the curve. A word of practical advice: press 
hard when drawing the curve. It also helps to cut a neighborhood of the curve, for 
it is inconvenient to work with too large a sheet. A more serious reason for 
restricting to a neighborhood is that this way one avoids self-intersections of the 
sheets, unavoidable otherwise. The result looks somewhat like Figure 3(a): 


SNA 


(a) 


Figure 3 


One may even start with a closed curve drawn on paper. To be able to fold, one 
has to cut a hole inside the curve; the result is shown in Figure 4. 


Figure 4 


It goes without saying that the argument in the opening paragraphs of this 
article does not contradict the possibility of folding along a curve: the two sheets in 
Figure 3(a) meet at a nonzero angle. To fix terminology call the curve drawn on 
paper the fold and denote it by I; call the curve in space obtained by folding along 
I’ the ridge and denote it by y. The above described experiments and dozens of 
similar ones that kept us busy lately reveal the following list of observations: 


(1) It is possible to start with an arbitrary smooth fold and obtain an arbitrary ridge 
provided the ridge is ““more curved” than the fold. 
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(2) If the ridge is only slightly more curved than the fold, then the neighborhood of 
the fold to be folded should be taken very thin, at least from the side of the 
convex domain of the plane bounded by the fold. 

(3) If the fold has an inflection point (i.e., point of zero curvature) then the 
corresponding point of the ridge is also an inflection point (notice that, unlike 
plane curves, a generic space curve does not have inflection points at all). 

(4) If the fold is a closed strictly convex curve then the ridge has a nonzero torsion, 
that is, does not lie in one plane. 

(5) If the fold is a nonclosed arc, the folded paper tends to occupy such a position 
that the ridge lies in a plane, and the angle made by the two sheets is constant 
along the ridge. 


What follows is an attempt to explain these experimental observations. A 
surface obtained by bending, without folding, a paper sheet is a developable 
surface, that is, a surface locally isometric to the plane (one cannot stretch paper!). 
The theory of such surfaces is due to Euler; its main results are as follows. A 
developable surface is a ruled surface, i.e., it consists of a one-parameter family of 
straight lines called rulings. These lines are not arbitrary: they are tangent to a 
certain space curve called the edge of regression (this description does not include 
two special cases, cylinders and cones, which are also developable surfaces). The 
tangent planes to a developable surface along every ruling coincide: one can put 
not only a knitting needle on such a surface but also a ruler. Thus a developable 
surface is the envelope of a one-parameter family of planes (see Figure 5). 


Figure 5 


Consider Figure 3(b). One sees two developable surfaces intersecting along a 
space curve y. Unfolding either of the surfaces to the plane transforms y to the 
same plane curve I’. Reverse the situation and pose the following question: given a 
plane curve I’, a space curve y and an isometry f: 1° — y, is it possible to extend f 
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to an isometric embedding of a plane neighborhood of I’ to space? Said differ- 
ently, can one bend a sheet of paper with a curve [’ drawn on it so that [' bends to 
a given Space curve y? 


Theorem. Assume that for every x € I the absolute value of the curvature of y at 
point f(x) is greater than that of T at x. Then there exist exactly two extensions of f to 
isometric embeddings of a plane neighborhood of V to space. 


These two embedded surfaces are the sheets intersecting along the ridge in 
Fig. 3(b). Extending the sheets beyond the ridge one obtains another configuration 
of sheets that meet along y. Thus there are exactly two ways to fold paper along I 
to produce the curve y. This explains and extends the first of the previously 
mentioned observations. 

If y lies in a plane, one of the sheets is obtained from another by reflection in 
this plane. In the general case of a nonplanar curve y the tangent planes of the 
two sheets are symmetric with respect to the osculating plane of y at its every 
point. , 


Proof: Parametrize the curves y and I by the arclength parameter ¢ so that 
y(t) = f(T()). Let the desired developable surface $ make the angle a(t) with the 
osculating plane of the curve y(t) (well defined since, by assumption, the curvature 
of y never vanishes). Denote by k(t) the curvature of the space curve y and by 
K(t) that of the plane curve I’. The geodesic curvature vector of y in S is the 
projection of the curvature vector of y in space onto S; thus the geodesic 
curvature of y equals k(t)cosa(t). Since an isometry preserves the geodesic 
curvature of curves, kcosa = K. This equation uniquely determines the nonvanish- 
ing function a(t) up to the substitution a — w— a. To construct the developable 
surface S from the function a(t), consider the plane through point y(t) that 
makes the angle a(t) with the osculating plane of y. Such planes constitute a 
one-parameter family, and according to the above described general theory, their 
envelope is a developable surface. = 


Remarks. 1. The theorem is hardly new: it is mentioned as an exercise in [2] with a 
reference to [1]. For a later discussion of paperfolding see [3]. 

2. A direct computation involving the Frenet formulas for y (which we omit) 
makes it possible to find the angle B(t) made by the rulings /(t) with the curve y(¢) 
in terms of the torsion x(t) of y: 


a'(t) — k(t) 


ee k(t)sina(t) ~ () 


For the two developable surfaces corresponding to the angles a(t) and 7 — a(t) 
one has: 
2«(t) 


cot B,(t) a cot B,(t) = ~ k(a)sina(t) 


Therefore the ridge y is a plane curve (i.e., « = 0) if and only if B, + B, = 7. In 
this case, unfolding the two sheets on the plane yields the straight rulings that 
extend each other on both sides of the fold [’; see Figure 6. 
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By, 


Figure 6 


The reader with a taste for further experimentation may find the following one 
of interest. Start with a fold [ and tape a number of pins on both its sides. In this 
way one prescribes the angles B,(t) and B,(t). Then fold along I, as illustrated in 
Figure 7. . 

Remark 2 may be used to explain the second of our experimental observa- 
tions. Namely, since cosa(t) = K(t)/k(t), the formula (1) for cot B(t) may be 
rewritten as 

a'(t) — x(t) 


cot B(t) = Vet)? Ky? 


Assume that K(t) is close to k(t); then a(t) is close to zero. We need, however, to 
assume that a'(t) is also close to zero. If, moreover, x(t) is bounded away from 
zero, then a(t) is small, and cot B(t) is large; hence the angle B(t) is small. It is 
clear that straight lines crossing the boundary of a convex domain in the plane at 
small angles cannot penetrate deep in the domain; hence, they have to cross each 
other near the boundary (see Figure 8). 


Figure 7 


But the rulings of a non-self-intersecting developable surface do not cross each 
other. Hence, to avoid crossings we need to make the neighborhood of the fold 
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Figure 8 


thin. The limit case of this observation is particularly interesting. Suppose that 
K = k. Then the isometry between the fold and the ridge cannot be extended into 
the convex domain bounded by the fold at all: It can be extended into the concave 
domain, and we get a developable surface, for which the edge is the edge of 
regression. Indeed, formula (1) for cot B(t) gives cot B(t) = ~; hence B(t) = 0, and 
the rulings of the surface are all tangent to the ridge. Of course, in this way we get 
only one of the two pieces of the surface, cut along the edge of regression. The 
other piece may be made of another copy of the same concave domain. The 
difference between the two pieces is that for each tangent to the boundary of our 
concave domain, divided into two halves by the point of tangency, one half is 
straight on one of the pieces and the other half is straight on the other piece. The 
image of the whole tangent on each piece is a curve, half of which is straight and 
half of which is curved, as illustrated in Figure 9. 


Figure 9 


The union of the two pieces with the images of the two tangents looks like 
Figure 10. 


Figure 10 
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As a by-product of these observations we learn how to make a paper model of a 
developable surface with a prescribed edge of regression (without inflection 
points). To do this we should draw a planar curve whose curvature is precisely the 
same as that of the intended edge of regression; two copies of this drawing should 
be made on two separate sheets of paper. Then we cut the sheets along the curves 
and take the concave portions of both. After this we bend the two pieces to make 
their edges fit into the given spatial curve. This may be done in two different ways, 
and we must bend our (identical) pieces into different surfaces; these two surfaces 
comprise the developable surface we are constructing. Since the “angle” between 
the two pieces should be 0, it may be useful to glue the two pieces before bending 
along a very thin neighborhood of the edges. But be aware, that this bending is not 
even a twice differentiable mapping (this is why we used quotation marks for the 
word “angle’’), and the paper will be resistant to this construction. Pins, attached 
to the two pieces tangentially to the edges (as shown on Figure 11) may help. 


Figure 11 


Back to our list of experimental observations. The first two have been explained, 
proceed to the third one. Let ['(t,) be a nondegenerate inflection point, so the fold 
looks like a cubic parabola near this point (see Figure 12). 


Figure 12 


Then K(t,) = 0 and, according to the already familiar formula kcosa = K, either 
a(t)) = 7/2 or k(t) = 0. We want to show that the latter possibility holds. 
Suppose not; then both sheets are perpendicular to the osculating plane of y at 
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point y(t,) and, therefore, coincide. Moreover, if k(t,)) # 0 then the projection of 
the curvature vector of the space curve y onto each sheet is the vector of the 
geodesic curvature therein. This vector lies on one side of y on the surface at 
points of the form y(t) — €) just before the inflection point and on the other side 
at points y(t, +e) just after it. Therefore the function a(t) — 7/2 changes sign at 

= t). This means that the two sheets pass through each other at ¢ = t,. Possible 
in the class of immersions, this cannot happen with real paper. Thus k(t,) = 0, 
that is, the ridge has an inflection point. 

Next, consider the fourth observation of our list. Assume that both y and I are 
closed plane curves and I is strictly convex. The relation between the curvatures 
still holds: kcosa = K, and K(t) does not vanish. Hence k(t) > K(t) for all ¢ and 
[k(t) > [K(t) unless a(t) identically vanishes. On the other hand, the integral 
curvature of a simple closed plane curve equals 277, so the above integrals must be 
equal. This is a contradiction. It is interesting that if T’ is closed nonconvex curve, 
one can nontrivially bend paper along I’ keeping I in the plane; see Figure 13. 


Figure 13 


Finally we turn to the fifth experimantal observation. This one takes us into 
dangerous waters because its explanation requires further assumptions concerning 
elasticity properties of paper. A strip of paper resists twisting: being relaxed it 
tends to become flat. Consider a space curve y(t) parametrized by the arclength. 
Let S be a thin strip along y and v(t) the unit normal vector field to y in S. 
Define the twist of S to be the length of the projection of the vector v’(t) to the 
normal plane of y(t). Our assumption is that a paper strip tends to minimize its 
twist. Let the strip make the angle a(t) with the osculating plane of the curve y(¢). 
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Then a computation, similar to the one mentioned in Remark 2, gives the value 
|x(t) — a'(t)| for the twist. Folding paper, one produces two strips along the ridge 
y(t), the angles being a(t) and a — a(t). The twists of these strips are equal to 
|x — a’| and |x + a’ |. Both quantities attain minimum if x(t) =0 and a(t) is 
constant. This appears to explain the fifth experimental observation. 
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Cicero on “pure” vs. “applied” science... 


Moreover, Catulus, if you ask me my personal opinion as to the 
study in question, I do not think that a person of ability, and 
acquainted at first hand with public life and procedure in parlia- 
ment and the law-courts, requires as much time as has been taken 
for themselves by those who have spent the whole term of their life 
in study. For all branches of knowledge are handled by those who 


apply them to practice in a different manner from that in which 
they are handled by those who take their pleasure in the pursuit of 
the sciences themselves and have no intention of following any 
other career. 


De Oratore, UI. xxiii. 86 
Contributed by Adi Ben-Israel, Rutgers University 
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Trigonometric Integrals 
and Hadamard Products 


L. R. Bragg 


1. INTRODUCTION. Let f(z) = 7 _,)a,z” and g(z) = X°_,b,z” be analytic 
functions in respective disks D, and D, centered at the origin. Let h(é) = 
Lu _9a,b,&€”" be analytic in some disk centered at € = 0. In a famous 1899 paper 
[5], J. Hadamard used the integral convolution 


h(é) = (2m)? f’"F(ze'*)g(ger"*) dO (1.1) 


(€ = zé) to characterize the singularities of the function h(€) in terms of those of 
f(z) and g(z) (see also [13, p. 157]). The coefficient-wise product of power series 
denoted by h(é) = f(z)° g(Z) is often called the Hadamard product of f(z) and 
g(Z) (alternative appellations include Schur product and quasi inner product). 

The Hadamard product appears naturally in a variety of theoretical and applied 
mathematical questions. While studying the Bieberbach conjecture long before L. 
deBrange’s proof, C. Loewner and E. Netanyahu [7] showed that a Hadamard-type 
product of two normalized (f(0) = 0, f’(0) = 1) univalent (one-to-one) analytic 
functions in the unit disk need not be univalent and can even have a zero 
derivative. St. Ruschewegh and T. Sheil-Small [12] showed that the Hadamard 
product of two normalized analytic convex mappings of the unit disk is a convex 
mapping. In [3], the Hadamard product appears in a variety of multiplier problems. 
In an 1894 paper [9], Th. Moutard used (essentially) the fact that an entry-wise 
product of positive definite matrices is positive definite to establish uniqueness 
theorems for solutions of a class of elliptic partial differential equations. Many 
properties and applications of the Hadamard product are surveyed in [6]. The 
author has used this product and a generalization to derive known and new 
formulas for special functions [1]. Finally, the Hadamard product, together with a 
factor switching property, was used in [2] to construct solution formulas for a 
variety of Cauchy-type problems. 

Because of its connection with the integral convolution (1.1), the Hadamard 
product can lead to elegant evaluations of complicated trigonometric integrals and 
provide analytic derivations of combinatorial identities. The student with a solid 
background in the calculus and basic differential equations can use it to carry out 
beginning studies on special functions and their representations while being 
introduced to notions of complex variables. We focus on these areas while 
emphasizing how to choose the functions in the associated Hadamard products. 

For example, consider the series definition for the Bessel function 


00 oO 


In(z) = LX (- 1)" 2*/{22"nl- al} = XS (1/(2"a!))((- 1)" /(2"a!)) 22" 
n=0 n=0 

and observe that the terms 1/(2"n!) are the coefficients in the series for e7/* 

while the terms (— 1)” /(2”n!) are the coefficients in the series for e~*/7. It follows 
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that 
Jo(z) =e? ce77/? = (2m) ee dé. 
Applying the Euler formulas to the integrand yields the standard integral formula 
J)(z) = (20) ["cos( zsin 9) dé. 


Before outlining the topics to be considered in the following sections, we 
introduce a Hadamard-type product depending on two integer parameters. 


Definition 1. For p and gq relatively prime, and for z © D,, ¢ € D,, define 
= 20 , ai 
f(z)p°g8(£) = (20) J f(ze?®) g( £e~4°) dO = Y ayyb,, 20%". (1.2) 
n=0 


If either f(z) or g(¢) is a polynomial, then the series in (1.1) is a polynomial in zé. 

In Section 2, we apply (1.1) in various ways to polynomials of the form (1 + z)" to 
obtain evaluations of numerous trigonometric integrals and prove binomial identities. 
Repeated Hadamard products are employed in Section 3 to evaluate multiple trigono- 
metric integrals. Section 4 treats integral representations of some special functions in 
which at least one of the functions entering the associated Hadamard product is not a 
polynomial. Finally, in Section 5, we introduce the selector function, which sums the 
coefficients of a subseries of a given series, and show how it can be used to replace 
certain sums by integrals. 


2. SPECIAL SUMS AND SINGLE INTEGRALS. Let us now use (1.1) to evaluate 
some special sums and trigonometric integrals. 


Example 2.1. We wish to establish the familiar binomial identity 
2n 2 
{| 2n n[ 2 
¥ (-0/(7"} = (-9"(2") 2.1) 
j=0 ¢ : 


(see [11] for this and related identities). On the one hand, through integration by parts, 
one obtains a reduction relation which permits showing 


Tn . 2n ae on an 
: sin“"@ dé aa : | (2.2) 


Alternatively, we note that 2isin@ = e'® — e~'® = (1 + e'’)(1 — e7!), which suggests 
considering the Hadamard product (1 + z)’"°(1 —z)*". With a, = | ae b, = 
(~—1)/a;, and p = q = 1, (1.1) gives 


1 


(1+z)"°(1—z)" = ror BS + ze?) "(4 ~ zei®)""do 
= )) (-1)’ f | 7, (2.3) 
j=0 
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Setting z = 1, we have 


7 n 27 zit j ? 
i sin?"9 do = (—1)" 1/7") (2.4) 
0 yen j=0 J 


which, together with (2.2) gives (2.1). 


Example 2.2. Suppose that p and q are relatively prime and let r be a positive integer. 
We prove 


T T é qr pr 
cos?’(g@)-cos?’( p@) dé = 2 
J cos?"(q8) - cosi( p8) soe (2 (7h (2.5) 


Replace cos@ by (e'® + e7'®)/2 and take f(z) = (1 +z)” and g(z) = (1 +z)?" in 
(1.2), which becomes 


f(2)p° g8(2) 


(2m) * (1 + zeP!*) "(1 + 20-4)” 
0 


s(cl(aler= 20 


[=0 


. 


Compute 
(1 +e?!) 7 (1 + ait” 


_ | eparie/2( ePi/2 % a Piey™ || erates (eat? fi e402)" 


g\7 g\?" 6 6 
= [20085] [200s - 210 e0s1"| F Jeoser( ©), 


insert this into the integral in (2.6) (with z = 1), make the change of variables 0 = 2¢, 
and (2.5) follows. 

A generalization of (2.5) can be obtained by taking f(z) = (1 +z)” and g(z) = (1 
+z)". By applying (1.1), taking the real part of the integral involved, and making a 
simple change of variables, one can establish 


ae m n T a 
if (cospp) (cosqh) cos(mp — nq) ddd = Taz, py | (2.7) 
where H = min{ 


Ar [=]}. A variety of analogous integrals appear in |4, pp. 
372-374]. 


Example 2.3. Let z, = z, =z and take f(z) = g(z) = (a + bz + az’)". The trino- 
mial expansion gives f(z) = X3".,c,z*, in which 


| y; , jet 
ee Ji>J2>J3 


where j, + J, + j, =n andj, = 0. We show that 


C, = 


2n 
> cg = 07", F,(—n, —n + 1/ 231; 4a*/b7). (2.8) 
k=0 


where ,F (a, B; y; z) denotes the usual Gauss hypergeometric function with parame- 


38 TRIGONOMETRIC INTEGRALS AND HADAMARD PRODUCTS [January 


ters a,B, and y [8, pp. 37-62]. Compute 
2n 
ye ce =(f(z) °f(2Z)) c= 
k=0 
= (27) "' [a + be!® + ae?!*)"(a + be~!® + ae~?!*)" do 
0 


= (20) [°"(2acose +b)P"do. (2.9) 


Expand the integrand in the latter integral, integrate term-by-term, and use the beta 
function and its reduction properties (see [8, pp. 7-8]) to find that 


pee Ariprn=2i 
(Qn — 2k) IKIKY 


(2.10) 


_ Finally, replace (2n)! /(2n — 2k)! in this by 27*(—n),(—n + 1/2), to obtain (2.8). 


3. SOME MULTIPLE TRIGONOMETRIC INTEGRALS. The Hadamard product 
of two polynomials is a new polynomial, which one can use to form a Hadamard 
product with another polynomial, etc. By repeating the Hadamard product operation 
and finally evaluating at z = 1, as in Section 2, one can evaluate various multiple 
trigonometric integrals. 


Example 3.1. As in (2.1.6), we have 
(1+z)"0(1 +z)" = (2m) ' [°"(1 + ze*)"(1 + zei*)"d0 
0 


y (=) 23 = F(z). (3.1) 


j=0} 


An application of (1.1) to F(z)°( + z)” yields 


LPC + ze (4 + 20 ON(1 + 20)" a0 


| 


“2 fe) ae 62 


Set z = 1 and observe that the bracketed term in the integrand can be written as 
23e'?/*cos((h + 0)/2) - cos((d — 8)/2)cos(h/2) = 27e'?/*cos(h/2)[cos & + 


cos 0]. Insert this into (3.2), with z = 1 , and take the real part of both sides to get 


» || 


[°" [“cos(nb/2)cos"(4/2)[cos + cos 0)" 1p d0= -E (*) (2). 


j=0 


os 
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A similar treatment of G(z) = F(z)e F(z) leads to the triple integral formula 


oe ; Di 3% 4 
l, l, f, [(cos 6, + cos @,)(cos 6, + cos 6;)] dO, dé, dé; = oF ~ Ci] | 


k=0 
(3.4) 


Example 3.2. The repeated Hadamard product [1 + z)” °0(1 + z)"]o(1 + 27)? yields 
the double integral formula 


f° [ 'cos[(m —n)0+(m+n-—2p)¢|cos”( 6+ d)cos"(6—  )cos?(2p)dd dd 


te Fe ¥ ele (OS) 
Jmtntp j=0 J 7 j : : 


Taking m = n = p, it is not difficult to establish 


as Ma Lense licon ehscoe-@) "aode = Fe ("7 ) 


4. SOME SPECIAL POLYNOMIALS. Thus far, both Hadamard product factors 
f(z) and g(€) have been polynomials. A convenient non-polynomial choice for f(z) is 
the exponential function and we shall use it to obtain trigonometric integral representa- 
tions for the classical Laguerre polynomials and for some special hypergeometric type 
polynomials. 


Example 4.1. The Laguerre Polynomials. The Laguerre polynomials are defined by 
L,(z) = Lio eae (7) (see [8, p. 239] or [10, pp. 200—217]). We identify i, as 


j | 
the Maclaurin coefficients of (1 + £)" and (—1)!/ j! as the Maclaurin coefficients of 
e-*, Then using (1.1), we have 


rset 8 (PEM (nel enscneay 


J 


| 27 —zel? -ie\" 
oe e7"(1 + Le!) 8. (4.1) 
If we now select z = x (real) and ¢ = 1, we obtain 
L(x) = Re{ (2) eee 0 ie ET eee a ao 40} 
0 
= 2"(27) | oe e~*°S"cos( xsin 6 + n0/2)cos”"( 6/2) dé 
0 
Sa f e*°S? Peos( xsin2 d + nd)cos"’> dd. (4.2) 
0 

Since L(O) = 1, this yields [{cos(n¢)cos"6 db = 17/2". 
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Example 4.2. 4 Hypergeometric Polynomial. Select f(z) =e % and g(£) = 
(1 — £)". Then (1.1) gives 


e7,0,(1~£)" = (27) freer" — £e~'9)" do 


leant 
- » = i) ~ Pals $), oo 


Choose z = x, x real, and £ = 1 and obtain 


B ae n! 
PD Bee Gein = aT 


Taking the real and imaginary parts of the integral in (4.3), and making the change of 
variables 0 = 2¢ gives 


(-1)"72" 
os 


(24) OP ae 
| e *°S “bsin(xsindd + nd)sin"hbdd for n odd. 
0 


[re ~*008 4% cos( xsin 4d + nd)sin"ddd for n even, 
0 
PAX, 1) _ 


17 


(4.4) 


5, THE SELECTOR FUNCTION. Let S,(z) = Lf_)z/ = (1 — z**")/ — z) and 
let P,(z) = U%_a,z/. Form P,(z), ° S,(z) and replace z by 1 to obtain 


7 on . (1 = eae nie) 
27) ' | P,(e!?) ———_—.,—. 40 = 
(2m) Pe) aa) 


7 gin g(k + 1)0/2 mi(i 
(2m)! f° Pe? lear i Ae em a,;. (5.1) 
0 7 sin g0/2 j=0 a 


The latter series is a sum of a subset of the coefficients of the polynomial P,(z) in which 
the indices are multiples of q. This particular subset also depends upon the choice of k. 
It is clear that if q = 1 and k = n, then the series in (5.1) reduces to P(1). Thus, with 
appropriate choices of gq and k, (5.1) can be applied in a variety of ways to replace finite 
sums by trigonometric integrals. Because of its utility for singling out particular terms of 
a polynomial, we refer to S,(z) as a selector function. 


To write an integral formula for the binomial sum ee) take P(z) = 
(1 +z)", q=1, and k =m in (5.1). Applying the Euler relations and taking 
§ = 2¢ leads to 
sin((m + 1)¢) 

sin(¢) 


While the denominator in the integrand vanishes at ¢ = 0 and ¢= 7, so also 
does sin((m + 1)@). The singularities cancel and the integral (5.2) is well defined. 


y " = — [“c0s"(d)eos((n —~m)¢) db. (5.2) 


j=0 
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As a final example, take P,(z) = (1 + z)?? and k =p in (S.1) with q>1. We 
leave it to the reader to conclude that 


7 sin ae | 27 P 
i) song Pe dg=—— aan (5.3) 
0 sin gd 2P8 NIG 
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Cluster Primes 


Richard Blecksmith, Paul Erdos, and J. L. Selfridge 


1. INTRODUCTION. A prime p> 2 is called a cluster prime if every even 
positive integer less than p — 2 can be written as a difference of two primes 
q — q', where g and q’ are both less than or equal to p. Due to the concentration 
of primes at the beginning of the positive numbers, the first 23 odd primes 3, 5, 7, 
11,...,89 are all cluster primes. The smallest non-cluster prime is 97: the previous 
prime is 89 and so 88 = 97 — 9 is not a difference of two primes smaller than 98. 
In general, if p is a cluster prime, then there must be enough primes in a “small” 
neighborhood to the left of p so that the even numbers p — 9, p — 15, p — 21, 
p — 25, etc., can all be written as the difference of primes less than p. 

The notion of cluster primes is reminiscent of the notion of prime constella- 
tions. The most famous prime constellation is that of the twin primes {p — 2, p}. 
More elaborate constellations such as {p — 8, p — 6, p — 2, p} have also been 
studied [4, pp. 64-68]. In a prime constellation both the number of primes and the 
differences between them are fixed. There is no guarantee that the largest prime in 
a prime constellation is a cluster prime. A sparse set of primes may lie just in front 
of this constellation. The first pair of twin primes {p — 2, p} for which p is not a 
cluster prime is {227, 229}, because the number p — 27 = 202 is not a difference of 
primes less than 230. To see this, observe that 202 = 211 — 9 = 223 — 21 = 227 
— 25, and 211, 223, 227 (the twin of 229) are the only primes between 202 and 228. 
We show in the proof of Theorem 1 that the number of primes in a small interval 
just before a cluster prime p grows in size with p. In this way, a cluster prime can 
be thought of as the largest prime in a “galaxy” of primes. 

It is reasonable to expect that among the primes, the cluster primes become 
increasingly rare. In spite of the initial head start of 23 consecutive cluster primes, 
the non-cluster primes quickly catch up, so that by the prime 2251 we have 167 
cluster primes and 167 non-cluster primes. Starting with 2267, the next prime after 
2251, the cluster primes begin to lag further and further behind. When we reach 
10°, the non-cluster primes outnumber the cluster primes by a ratio of about 325 
to 1. 

The simplest question we can ask about the distribution of cluster primes Is: 


Are there infinitely many cluster primes? 


An affirmative answer would imply that p,.,, — p, < 6 for infinitely many primes 
P,,, Which is a well-known hopeless problem. We enjoy more success looking for an 
upper bound for 7,(x), the number of cluster primes not exceeding x. In Section 2 
we show that eventually 7,(x) is less than x/(og x)°, for any fixed positive integer 
s. Our result “suggests” that the cluster primes are less numerous than the twin 
primes, although we have no way of proving that either of these two collections 
is infinite. Our theorem is powerful enough, however, to show that the sum of 
the reciprocals of the cluster primes converges, a result well-known for the 
twin primes. 


1999] CLUSTER PRIMES 43 


In Section 3 we present an efficient algorithm, which, given a particular cluster 
prime p,, determines the next cluster prime greater than p,. We used this 
algorithm to compute the cluster primes up to 10” and we give the values of 77,(x) 
for powers x = 10“, where k ranges from 2 to 13. These data and a discussion of 
the results are presented in Section 4. We conclude with a comparison of the 
number of cluster primes versus the number of pairs of twin primes. 


2. AN UPPER BOUND FOR 7,,(x). We have the following result: 


Theorem 1. For every positive integer s, there is a bound x) =x,(s) such that if 
X > Xp then 


m7,(x) < 


c 


(logx)” ) 


The proof is based on the following two lemmas: 


Lemma 1. Let a(x) denote the number of primes <x. Then for x = 6, 
2X-= 6 
logx 


a(x) < 


Proof: We use the estimate of Rosser and Schoenfeld [5]: 
1.256x 


log x 


a(x) < =. Der als (2) 


Since 
2x—6 1.256x + (0.7x —6)  1.256x 
2 


log x log x log x 


for all x > 9, Lemma 1 follows for x > 9. One can easily verify that the formula in 
the lemma holds for 6 < x < 9. x 


Our main tool in proving Theorem 1 is the following application of Brun’s sieve. 
The notation f(x) < g(x) means that for some constant M and value Xp, 
| f(x)| < Mg(x) for all x > xp. 


Lemma 2. Let s be a natural number, let d,,...,d, be s distinct, nonzero integers, 
and let f(x) count the number of primes p in the interval 0 < p <x such that the 
differences p — d, are prime for each i = 1,..., 8. Then 

s+]? 


1 p(p)—s 1 
fay< TT {i-— 5 OR ed Berseges 
PIT ci<j¢5(4;-4)) P p\d,-d, P (log x) 


where p(p) denotes the number of modulo p distinct numbers among the d,’s, and 
where the constant implied by the < -notation depends only on s. 


Xx 


For a proof of this lemma, take y = x in Corollary 2.4.2 in [2, p. 81]. 


Proof of Theorem 1: Suppose p is a cluster prime. We wish to get a lower bound 
on the number of primes in the interval [p — t, p), where ¢ is a small positive 
integer, to be specified later. By the definition of cluster prime, every even number 
2r in the interval p — t < 2r < p — 3 must be of the form g — q’, where g and q’ 
are primes <p. Clearly q' must be <t. By Lemma 1, if t => 6 the number of 
these primes q’ is less than 2(t — 3)/log t. On the other hand, there are more than 
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(t — 3)/2 even numbers in the interval [ p — t, p — 3]. Thus there must be at least 
+ log t primes in [p — t, p). Define 


aa . 


ways to place s numbers gq, >q,> °°: >4q, in the interval 


There are : 


[p —t, p). (We allow the g,; to be even to simplify the calculations.) Using the . 


crude estimate {‘] < nt‘, there are fewer than ¢t* choices for the s differences 


d,;=p-—q, 1<is<s. If the differences d, < --- <d, are fixed, Lemma 2 en- 
sures that the number of choices of p <x such that each p — d, is prime is at 
most Mx /(log x)°*', where M is a constant depending only on s. Thus 


x 
a(x) <M————t'. 
( ) (log xo 
Now given s, let t be the least positive integer satisfying equation (3). Taking x so 
large that t° < log x, we have 


x 

Te 4) SM Se 

(log x) 

Theorem 1 follows easily. icy 
A consequence of Theorem 1 is the following result. 


Theorem 2. The sum of the reciprocals of the cluster primes is finite. 


Proof: If the set of cluster primes is finite, there is nothing to prove, so assume 
there are infinitely many, and denote the n-th cluster prime by q,. Consider 
Theorem 1 with s = 2. For n sufficiently large, 


c n 2° 
(log q,,) 
But 7,(q,) =n and (log q,)* > (og n)*. Thus, for n sufficiently large, we have 
g, > n(log n)*. Since the series Xn~ ‘(log n)~* converges, by the integral test, it 
follows that 21/q, converges by the comparison test. | 


It appears that a stronger result than Theorem 1 may actually be true: 


Conjecture. For some constant a, we have 


x 
Tex) ego “) 


This result would follow from Lemma 2 if we could guarantee that the implied 
constant in the lemma does not grow too fast as a function of s. 


3. THE ALGORITHM. Let p, be the mth prime. We describe an algorithm that 
inputs the index v of the current cluster prime p, and returns the index of the 
next cluster prime. The idea is simple. Since p, is a cluster prime, we know that 
every even integer from 2 to p, — 3 can be expressed as a difference of two primes 
not greater than p,. In order to check whether the next prime p,,, is also a 
cluster prime, we need examine only the even numbers p, — 1 through p,,, — 3. 
If p41 —P, = 2, 4, or 6, then there is nothing to check; p,,, is the next cluster 
prime. If p,,, —p, = 8, then p,,, 1S a non-cluster prime, since p,,, — 9 cannot 
be a difference of two smaller primes. In this case we examine the even numbers 
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Pn+1 — t, where t is an odd composite less than or equal to p,,,; — p, + 1. For 
each such ¢ we look ahead in the sequence of primes {p,,,,.;},-; until qd’ =¢ + 
Pn+m+1 —Pn+1, turns out to be prime. Here lies the significance of little m (as 
opposed to capital M) in the algorithm: p,,,,,, is the first prime for which the 
particular even number p,,, — ¢t can be written as a difference p,.,,,, —q' of 
primes; so we are at least m primes away from the next cluster prime at this stage 
of the algorithm. Capital M is the maximum number of primes to the next 
(possible) cluster prime, based on all of the previous values of m found so far. 
When we move on to the next prime at the end of the outer do-loop, we decrease 
M by 1, since we are now one prime closer to the cluster prime we are seeking. We 
continue processing consecutive primes until M = 0, indicating that we have 
finally reached the next cluster prime. For example, the next prime after the 
non-cluster prime p,; = 97 is px, = 101. Since p,, — 9 = p,, — 13, 101 is the next 
cluster prime after 89. In our algorithm, we need a list of the prime differences 
diff [nm] = p,., — p,- We do not require the actual values of the primes themselves, 
just the differences, since 


= ) diff[n + i]. 

i=1 
We also need a short list, named odd—comp, of the odd composites 9, 15, 21, 25, 
27, 33, etc., as well as a look-up table to tell when a “small” odd integer is prime. 


Pn+m+1 — Pn+1 


Algorithm Find—next—cluster—prime(n, current_prime). 


M = 0; 
do 
d = diff[n]; 
if (d > 6) 
for (i = 1; odd_comp[i] <d+1;i=i+ 1) 
m = OQ; 
t = odd_comp [i]; 
repeat 
m=m+ 1; 
t=t+ diff[n + m]; 
until ¢ is prime; 
if(m>M)M=m; 
n=n+1; 
current_prime = current_prime + d; 
M=M -1; 
while (M > 0) 
return n; 


It is worth pointing out that the efficiency of this algorithm is due to the fact 
that it always looks forward and never needs to backtrack. In actual practice, the 
program spends more time sieving for the prime differences than it does running 
the algorithm. 


4. DISTRIBUTION OF CLUSTER PRIMES UP TO 10%. We encoded this algo- 
rithm in a C program and ran it on a 300 MHz Sun Ultra 2 Workstation. Our goal 
was to tabulate the cluster primes up to 10” in order to get an indication of their 
distribution. The following short table gives the number of cluster primes versus 
non-cluster primes for powers of 10. Here 7,(x), 7,(x), and 7,(x), respectively, 
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denote the number of cluster primes, non-cluster primes, and twin prime constella- 
tions less than or equal to x. Since we do not count 2 as a cluster or non-cluster 
prime, we have the equation 7.(x) + a(x) + 1 = w(x), which we can use as a 
check on the data. The values of 7,(x) were computed by Brent [1] and can.also 
be found in [2, p. 262]. The last column gives the value of a for which (4) becomes 
an equality, viz. a = log(x/7,(x))/(og log x)’. 


8 
0.69 35 
1.92 205 
4.31 1224 
8.47 8169 


15.61 58,980 

5,559,246 27.49 440,312 

49,787,721 46.98 3,424,506 

449,315,654 78.32 27,412,679 

31,914,282 — 4,086,140,530 128.03 224,376,048 
182,065,897 37,425 ,846,120 205.56 1,870,585,220 
1,061,375,739 345 004,161,099 325.05 15,834,664,872 


By x = 10% the non-cluster primes outnumber the cluster primes by a ratio of 
roughly 2 to 1. As expected, the ratio 7,(10“)/7,(10*) increases for each exponent 
k, and when we reach 10’, approximately 0.3% of the primes are cluster primes. 
We can show this behavior by using Theorem 1 together with the prime number 
theorem, which states that w(x), the number of primes not greater than x, is 
asymptotic to x/log x. Since 7,(x) is eventually less than x /(log x)’, it follows that 
the ratio 7,(x)/7(x) approaches 0 as x tends to infinity. Thus, “most” primes are 
non-cluster primes and the ratio m,(x)/a,(x) = (7(x) — D(n(x))"' — 1 must 
tend to infinity. 

It is interesting to contrast the columns for 7, and a,. The ratios 
—q(10**"')/7,(10*) seem to be approaching the limit 10. A proof of this observa- 
tion follows from the fact that a,(x) is asymptotic to w(x) and from the prime 
number theorem: 


lim 


iL Se (ye a a |, ey | AE | 
koe (10°) koe m(10*) kaw (k + 1log10/ klog10 


77,(10**") . a (10**") 10*+} / 10* 
For the cluster primes, the ratios 7,(10**!)/7,(10*) increase from 4.24 for k = 3 
to 5.83 for k = 12. It is difficult to predict a limit from such limited data. Heuristic 
considerations, however, suggest that 7,(x) has the shape x'~"“, where h(x) is a 
function whose limit tends to 0 as x goes to infinity. If this estimate is correct, then 
the ratios 7,(10**!)/,(10*) would also tend to 10, though more slowly than the 
ratios for the non-cluster primes. 

As the program ran to 10”, the largest value of M in the algorithm 
Find_next—cluster_prime was 58 and the largest value of t was 1503. The largest 
number of consecutive non-cluster primes was 10,543, found between the cluster 
primes 8,353,771,390,333 and 8,353,771,707,107. The difference between these two 
primes is 316,774, the largest difference found up to 10”. 

It is worthwhile noting that past 10°, the number of cluster primes lags behind 
the number of twin primes. For x = 10’* the number of twin primes is roughly ten 
times larger than the number of cluster primes. To explain this phenomenon, put 
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s = 1 and d, = 2 into Lemma 2 (Brun’s sieve) to get the upper bound 


< ——. 
m1(2) « 7 (5) 
Brun used this estimate in 1921 to show that the sum of the reciprocals of the twin 
primes converges; the proof is essentially the same as our proof of Theorem 2. 
Comparing (5) with Theorem 1’s estimate 7.(x) <x/(log x)* for any positive 
integer s, we would expect the cluster primes to be rarer than the twin primes. In 
the interest of honesty, however, we must admit two facts. First, the estimate in (2) 
for 7,(x) holds for x => x,(s). On examining the proof of Theorem 1, the value of 
x(s) is roughly x)(s) = e”, where t = e*’. For s = 3, this bound is x, =e° , a 
number with approximately 1.87 x 10° decimal digits. It seems a bit presumptu- 
ous to think that we are seeing the effects of Theorem 1 with s =3 for the 
comparatively small 13 digit numbers. The second remark is that although upper 
bounds may indicate what happens, they are not conclusive. For all we know, the 
number of cluster primes and twin primes could both be finite. In 1922 Hardy and 
Littlewood conjectured that 7r,(x) is asymptotic to 


Ee) ae x & 
AG ea) [ ——— = 1320323632 [/ —. 
p23 (p— 1) 72 (log x) 2 (log x) 


This famous conjecture has been shown to be remarkably accurate in estimating 
the number of twin prime constellations [3, p. 66], and strengthens our belief that 
m7 (x)arrowo with x. That 7,(x) tends to infinity seems harder to prove. 
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NOTES 


Edited by Jimmie D. Lawson and William Adkins 


Fermat’s Last Theorem for Gaussian 
Integer Exponents 


John A. Zuehlke 


In this note, we observe that Wiles’ Theorem [2] on the impossibility of 
xn + yi = 7f 


for x, y, Z positive rational numbers with integer exponents n # +1, +2 can be 
generalized to the case of Gaussian integer exponents v = n + im without addi- 
tional exceptions. The proof uses the Gelfond-Schneider Theorem [1], according to 
which a? is transcendental for 8 algebraic but not rational and a algebraic 
#Q,1. : 

The proof almost fits into the margin. In fact, from 


x” +y"=Zz",with v=nt+im, m#0 
it follows by taking the complex modulus squared that 
/ x*" + 2x"y" cos 0+ y*" =z°", with 6 = m log(x/y), 
so cos @ is rational. Since, for any real number 6 whatsoever there is the identity 
e748 _ Icos 6e'9 + 1 = 0, 
it follows for the particular @ that 
el? = (x/y)™ 


is algebraic. Then the Gelfond-Schneider Theorem, with a =x/y and B=im 
forces x = y. Therefore 


(27x) = 2; 
forcing z =x similarly, contradicting y # 0. 


We remark that the generalization holds, with the same proof, for exponents 
vy =n + im, with n an integer and m a real algebraic number. 
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On Rational Function Approximations 
to Square Roots 


M. J. Jamieson 


Interest in methods for calculating square roots exists partly because the speed 
with which a computer can evaluate them contributes a measure of its overall 
performance [1]. Newton’s well known method for improving estimates of a square 
root uses a simple rational function approximation and converges in second order; 
an iterative method converges in second order if it generates a sequence {s,} that 
tends to limit s and, in which the error (s,,, — s) tends to K(s, — s)* for some K, 
independent of n, as n tends to infinity. This note presents a rational function 
approximation method with faster convergence. It uses a result of Frank [2] on 
periodic continued fractions; a formula given in the 15th century by Al-Kalsadi 
[3, p. 111] is also based on continued fractions and is a special case. 

Frank studied properties of the convergents (or approximants) to the pure 
periodic continued fraction representing the quadratic surd 


ae 1 1 1 
L+VC = [b,by, ae as a ee ee (1) 


where L and C are rational numbers with C positive, the b’s are positive integers, 

p is the period and the overbar indicates the periodic part of the continued 
fraction. There are restrictions on the values of L and C in order that the 
representing continued fraction be pure periodic [4, p. 101], but they are satisfied if 
L is zero and C is an integer (when b, = 2bo, although this is not important here). 
A special case of a property given by Frank ([2, (2.2)] with s = 1, C = N, L = 0) is 


Xia; = (4;%, + N)/(%; + %;) for i,j > 0, (2) 
where x; is the (pi — 1)th convergent to the continued fraction. It can be shown by 
induction, from Pascal’s triangle rule for binomial coefficients a that the 
convergents satisfy 

x, =F,(x,) fork > 0, (3) 
where, if k is odd (= 2m + 1), 


= i k K—-2ingi + v k k-2i-1n7i 
Fe) ph ah a pa i" 
and, if k is even (= 2m), 
i=m k t=m—1 k 
= k-2ingi = k-2i-1yzi 

F,(x) i Pee N py fea tcye NN’. (4) 
The function F,(x) has a fixed point at YN. Equation (3) gives x, in terms of xj. 
By considering the period of the continued fraction to be the multiple k”p of p 
instead of p itself we obtain the same formula giving x,.+1 in terms of X;,.. 
Thus function (4) generates a series of approximations to VN which form a 
subsequence of the convergents to the continued fraction if one starts with x = x,. 
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We can use function (4) in an iterative scheme for finding the square root of an 
integer. To find the square root of a rational number g/r, say, we calculate Var 
and divide by r. 

The theory of continued fractions guarantees convergence (unless k = 1) but we 
must know the value of the (p — 1)th convergent to start the iterative sequence. 
This is inconvenient. However, by the following theorem, convergence is also 
guaranteed if we start with an arbitrary positive value x. 


Theorem. For x > 0, y, = Fen(x) > VN. 


We generalize (2) and (3) and replace (2) by 
Faj(x) = [R(X B(x) +N] + (R(x) + F()] foring>0. (5) 
From (3), (4), and (5) we find 
F(x) - ¥N =(x- UN )| Fy_3(4) — VN | +[x+F,_,(x)] fork > 1. (6) 


From the definition of F,(x), if x is positive so is F,(x) for any k. By induction 
from (3), (4), and (6) we find 


F.(x)>=VN if x=VN, (7a) 
F(x) <x if x>VN. (7b) 


These inequalities imply that repeated application of the function F,(x) gener- 
ates a strictly monotonic decreasing sequence whose greatest lower bound is YN if 
the starting value exceeds YN. Hence convergence is guaranteed for any starting 
value exceeding YN. An argument similar to that leading to inequality (7a) shows 
that 


F(x) =VN_ if x < VN fork even, (8a) 
F,(x) <VN if x < VN for k odd. (8b) 


For even k and starting value smaller that YN the first generated value exceeds VN 
and convergence is guaranteed by the argument of the preceding paragraph. If k is 
odd it can be shown from (6) and (8) that the function F,(x) is strictly monotonic 
increasing with least upper bound equal to YN for x < VN. Thus convergence is 
guaranteed for any positive starting value. Equation (6) shows that the convergence 
is of order k. 

The first two estimates of V2 (1.414213562) obtained with 


F,(x) = (x* + 6x*N + N*)/(4x° + 4xN) (9) 


and starting value 1 are 17/12 = 1.416666667 and 665857 /470832 = 1.414213562, 
rounded to nine decimal places; convergence is rapid. 
The Newton method is 


F,(x) = (4° +N) /2x. (10) 
The approximation of Al-Kalsadi is 
V(a* +b) = (4a° + 3ab) /(4a? + b), (11) 


which is F;(a) with N replaced by a* + b; with starting value 1 (a = 1) the first 
estimate of V2 is 7/5 = 1.4. 
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A Note on Jacobi Symbols 
and Continued Fractions 


A. J. van der Poorten and P. G. Walsh 


1. INTRODUCTION. It is well known that the continued fraction expansion of a 
real quadratic irrational is periodic. Here we relate the expansion for vrs , under 
the assumption that rX* — sY* = +1 has a solution in integers X and Y, to that 


of yr/s and to the Jacobi symbols (7), which appear in the theory of quadratic 


residues. 

We have endeavoured to make our remarks self-contained to the extent of 
providing a brief reminder of the background theory together with a cursory sketch 
of the proofs of the critical assertions. For extensive detail the reader can refer to 
[5], the bible of the subject. The introductory remarks following in Sections 2-3 
below are inter alia detailed in [1]. 

Let p and q denote distinct odd primes. In [3], Friesen proved connections 


between the value of the Legendre symbol (2| and the length of the period of the 
continued fraction expansion of ypq. These results, together with those of 


Schinzel in [6], provided a solution to a conjecture of Chowla and Chowla in [2]. 
We report a generalization of those results to the evaluation of Jacobi symbols 


( , and, in the context of there being a solution in integers X, Y to the equation 


rX* — sY* = +1, remark on the continued fraction expansion of yr/s vis a vis 
that of vrs. 


Theorem 1. Let r and s be squarefree positive integers with r > s > 1, such that the 
equation rX* — sY* = +1 has a solution in positive integers X, Y. Suppose the 
continued fraction expansion of Vrs is [ay,a,,@ ,...,a)]. Then both the length of the 
period | = 2h, and the ‘central’ partial quotient a,, are even, and the continued 


fraction expansion of yr/s is 


1 ae bel! 
Eerie er ieee | =F Ee ee ee re eee 
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Theorem 2. Let r and s be squarefree positive integers with r > s > 1, such that the 
equation rX* — sY* = +1 has a solution in positive integers X, Y. Denote by I the 
length of the period of the continued fraction expansion of Vrs. Then the following 
Jacobi symbol equalities hold: 


r =1\7*! s —1)\2 
err « Geter 
As an immediate consequence we obtain the following results, which respectively 
appeared as Theorem 2 and Theorem 5 in [3]. 


Corollary 1. Let p = gq = 3 (mod 4) be distinct primes and set N = pq. Denote by | 
the length of the period of the continued fraction expansion of VN . Then 1 is even, and 


5) 


where e = 1ifp<qande= —-lifp>d. 


Corollary 2. Let p = 3 (mod 8) and q =7 (mod 8) be primes and set N = 2 pq. 
Denote by I the length of the period of the continued fraction expansion of VN . Then I 


is even, and 
q 


where €= 1 if 2p <qande= —-1if 2p>q. 


2. CONTINUED FRACTIONS. In this section we recall some basic facts about 
continued fractions that will be appealed to in the proof of our results. 

Given an irrational number a, define its sequence (a;);. 9 of complete quotients by 
setting a) =a, and a,,, =1/(a; —a,). Here, the sequence (a;),, of partial 
quotients of a is given by a; =|a;| where |:| denotes the integer part of its 
argument. Plainly we have 


os rr a 
2 a. a ° . 
It is only the partial quotients that matter, so such a continued fraction expansion 
may be conveniently denoted just by [a, a,, a5, a3,...]. 

The truncations [a,),@,,...,a;] plainly are rational numbers p,/q;. Here, the 
pairs of relatively prime integers p,, q; are given by the matrix identities 


a) l\fa, 1) fa; 1) _ 
1 O/\1 O 1 0 
and the remark that the empty matrix product is the identity matrix. The alleged 


correspondence, whereby matrix products provide the convergents p,/q;, may be 
confirmed by induction on the number of matrices on noticing the definition 


Pi Pi-i 
qi Wi-1 


[@),@,,..-,4;] =a, + 1/|a,,...,@;], [ay] = ap. 
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Incidentally, it clearly follows from transposing the matrix correspondence that 


Gps@y 3002387 HO ge45 Tort = 2s (1) 
The matrix correspondence entails p;/q; = p;-1/qi-1 + (—D"'/qi-14; whence, 
by induction, a = ay + L7_,(-1)'"'/q;_,4,;, and so 


0< (-1)" "(qe — Pi) <1/4i41> 
displaying the excellent quality of approximation to a provided by its convergents. 


Conversely, if 
Iga — p| <1/24q, (2) 
then the rational p/q must be a convergent to a. | 


3. CONTINUED FRACTIONS OF SQUARE ROOTS OF RATIONALS. Ifa 
= yN, for positive integer N not a square, it is well known and easy to confirm by 
induction that its complete quotients a, are all of the shape a, = (P, + VN)/Q,, 
with the sequences of integers (P,) and (Q,) given sequentially by P..,+P.=a,Q,, 
and Q,,,0,=N-—P?.,, where a,=VN entails P) =0 and Q, = 1. Plainly, 
always P* = N (mod Q,). Moreover, it is easy to see that the integers P, all satisfy 
0 <P,< VN and the positive integers Q, are all less than 2VN. It follows by the 
box principle that the continued fraction expansion of YN must be periodic. Much 
more is fairly clear. 

First, note that the generic sep in the continued fraction algorithm for a = VN 


is a, = (P, + VN)/QO, =a, — — YN )/Q,. Under conjugation VN > — VN, 
this step transforms to 
(Pra + VN )/Q; = OF (P; _ VN )/Q;. (3) 


But the 0-th step, ingeniously adjusted by adding a, = P,, 
a) + VN = 2a, — (a) — VN) 

is plainly invariant under conjugation. Moreover, because —1 < P, — VN < 0 we 
have (P, + VN)/Q, > 1. On the other hand P, + VN > 1 of course entails 
-1< (Es VN )/Q, <0. It’s now easy to see, by induction on i, that in (3) 
—1<(P,- VN)/Q, < 0. So a, is the integer part of (P.,, + WNY/Q, and (3) is a 
step in the continued fraction expansion of a, + VN, and thus of VN. 

It follows that the sequence of steps detailing the continued fraction expansion 
of a) + VN is inverted by conjugation, that since it has a fixed point the entire 
tableaux must be periodic, and that, with / the length of the period, we must have 


ag + VN = [2a Gi Go scuasnsil (4) 


moreover with the word a,a, -:: a,;_, a palindrome. 


Lemma. The symmetry just mentioned entails that for even period length I there is a 
‘central’ step, ath = 5l, 


a, = (P, + YN)/Q, = an - (Pray — VN )/Q,, 


invariant under conjugation. So P,,, = P,, anda, = 2P,/Q,,. It follows that Q,|2N. 
Conversely, if N is squarefree and Q,|2.N, where j # 0, then | is even and 2j = 1. 


Proof: Plainly Q,|N — P? and Q,|2P, entails O,|2.N. As regards the converse, it 
suffices to notice that Q)|N — Pp? and Q;|2N implies Q, 2PF. The only Bn 
square factor of Q; Is 4, since Ni is squarefree andQ, |2.N, so it follows that Q; 2 P. 

say 2P./Q; = a,. Thus, a; = CE at VN )/Q; = =a, 2 (P= VN)/Q, is a step in the 
continued fraction expansion of '/N N invariant under conjugation. it therefore must 
be the central such step, and this is what we were to show. mi 
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Again by induction, or otherwise, one can confirm that 


Py Di-r\{1 Pia, P, NG; 
Gd; G-1})\9 Qiay qi Pi 


which entails in particular that p? — Ng? = (-1)'t'Q,,,. In other words, the Q,,, 
arise from the convergents as just indicated. 

Conversely, one sees that when x? — Ny? =¢ with |t|< VN then, if ¢> 0, 
Ix/y — VN| < 1/2y?, whilst if t < 0 then |y/x — 1/ VN| < 1/2x?. In either case 
it follows from the remark following (2) that x/y is a convergent to VN, whence t¢ 
is (—1)'*'Q,,,, some i. 


? 


4. PROOF OF THE THEOREMS 


Proof of Theorem 1. Set N = rs. By the definitions of the sequences (P,) and (Q,) 
we have 


(P, + VN )/Q, = [4),4)41,4,495---] = Dis Opisweee Opie @rceuaAel: 


Let CX, Y) be a positive integer solution to rX* — sY* = +1. Then (sY)? — NX? 
= ¥5, so because s < YN it follows that sY/X is a convergent to yrs, and, more 
to the point, s is some Q, for VN. Since, trivially, s|N = rs, we see that the lemma 
entails that i = 31 = h, whence sY/X = p,_,/4q;_1- 

Here Q, = 5 is squarefree by hypothesis and, since now it divides N, the 
argument given at the lemma entails that Q,|P,. Thus P,/Q, = 4a, is an integer, 
and so 


vrs /O, = yr/s = EZ eee 


Finally, our remark at (1), or, if one prefers, the observation at (4) that the word 
a,a, ‘+ a;_, 1s a palindrome, provides the given formulation of the expansion. Mm 


Proof of Theorem 2. We saw in the proof of Theorem 1 that the data entails 
SY/X = Py_1/Gp_-1- Thus 


pi. — Nqp-1 =(-1)"Q, is (sY)° — nX? = (-1)'s, 


and so sY* — rX” = (—1)", from which the desired conclusions follow. a 
We now establish the proofs of the corollaries. 


Proof of Corollary 1. With N = pq divisible by a prime congruent to 3 modulo 4, 
it is plain that U* — NV* = —1 has no solution in nonzero integers U,V. Thus 
the period of Vpq has even length / = 2h, say. Hence there is a solution in rela- 
tively prime integers x, y for x* — pgy* = +Q, with some Q, dividing 2 pq, and 
1<Q, < 2ypq. 

However, it is plain that x? — pgy* = 2 (mod 4) is impossible so Q, must be one 
of p or qg; say Q, =q. But x* — pgy* = ¥q implies x = gY, y =X, giving a 
solution in integers X, Y to pX* — gY* = +1, satisfying the conditions of 
Theorem 2. | 


Proof of Corollary 2. As in the proof of Corollary 1, U* — 2pqV* = -1 is 
impossible in nonzero integers U,V, so there is a solution in relatively prime 
integers x, y for x* — 2pqy? = +Q,, for some Q, dividing 4pq, and 1 < Q, < 
2V2pq. 
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It’s easy to see that the possibilities modulo 8 are x? — 2pqy? = +2p or 
x? — 2pqy’ = +q and that either yields integers X,Y satisfying 2pX* — qY* = 
+1. Thus again the hypotheses of Theorem 2 are satisfied, and the result follows 


by noticing that the Jacobi symbol 2} = 1 for g = 7 (mod 8). i 


5. CLOSING REMARKS. Suppose we know both that 
r/s = | ais Gy swans GAs coen ep | and yrs = ce eee eee 


The two expansions have the same ‘tail’, that is, they differ only in a finite number 
of initial partial quotients. Thus the numbers vrs and fr/s are equivalent and one 
sees, for example from the matrix correspondence, that there are integers X, Y, U, 
and V satisfying VX — UY = +1 and so that (U~r/s + BY Xyr/s + Y) = vrs. 
But, removing the surd from the denominator yields 
(rUX — sVY) + (VX — UY) vrs 
rX? — sY? 
It follows that rUX — sVY = 0 and, this is the point, rX¥* — sY? = +1. So the 
shape of the two continued fraction expansions, and first principles, shows that 
there is a solution in integers X,Y to rX? — sY* = +1. 

We might also recall a cute result mentioned by Nagell [4]. Namely, given an 
integer N, consider the collection of all equations aX* — bY* = +1 with integers 
a and b so that ab = N. Nagell’s remark is that at most two of that collection of 
diophantine equations can have a solution. One of us happened to have been 
reminded of this fine fact by Dmitri Mit’kin at a meeting at Minsk, Belarus in 1996. 


= vrs . 


Proof: The cases N < 0 or N = O are uninteresting and trivial, so we suppose 
that N is positive and is not a square. Then we have at least one equation with a 
solution, namely 1-X* — NY? = 1. Further, if the length / of the period of VN is 
odd then also NX? — 1-Y* = 1 has a solution. If there is some other equation 
with a solution, say aX* — bY* = +1 with a > b> 1, then, as we saw above, 
(bY)? — NX* = ¥bsol = 2h, b = Q,, and #1 = (—1)"*'. Thus there is at most 
one ‘other’ equation, and if it has a solution then / is not odd. | 
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THE EVOLUTION OF... 


Edited by Abe Shenitzer 
Mathematics, York University, North York, Ontario M3J 1P3, Canada 


The Birth of Literal Algebra 


I. G. Bashmakova and G. S. Smirnova 
Translated from the Russian by Abe Shenitzer 


1. MATHEMATICS IN THE FIRST CENTURIES AD. DIOPHANTUS. The Baby- 
lonians developed a kind of numerical algebra. Then came Greek geometric 
algebra. 

The third—very important—stage of the development of algebra began in the 
first centuries AD and came to an end at the turn of the 17th century. Its 
beginning was marked by the introduction of literal symbolism by Diophantus of 
Alexandria and its end, by the creation of literal calculus in the works of Viéte and 
Descartes. It was then that algebra acquired its own distinctive language, which we 
use today. 

The first century BC was a period of Roman conquests and of Roman civil wars. 
Both took place in the territories of the Hellenistic states and the Roman 
provinces and were accompanied by physical and economic devastation. One after 
another, these states lost their independence. The last to fall was Egypt (30 BC). 
The horrors of war and the loss of faith in a secure tomorrow promoted the spread 
of religious and mystical teachings and undermined interest in the exact sciences, 
and in abstract problems in mathematics and astronomy. In Cicero’s dialogue On 
the state one of the participants proposes a discussion of why two Suns were seen 
in the sky. But the topic is rejected, for “even if we acquired profound insight into 
this matter, we would not become better or happier.” 

In the second half of the first century BC mathematical investigations came to a 
virtual halt and there was an interruption in the transmission of the scientific 
tradition. 

At the beginning of the new era, economic conditions in the Hellenistic 
countries, now turned Roman provinces, gradually improved, and there was a 
revival of literature, art, and science. In fact, the 2nd century came to be known as 
the Greek Renaissance. It was the age of writers such as Plutarch and Lucian and 
of scholars such as Claudius Ptolemy. 

Alexandria continued its role as the cultural and scientific center of antiquity 
and, in this respect, Rome was never its rival. Nor did it ever develop an interest in 
the depths of Hellenistic science. As noted by Cicero in his Tusculanae disputa- 
tiones, the Romans, unlike the Greeks, did not appreciate geometry; just as in the 
case of arithmetic, they stopped at narrow, practical knowledge of this subject. 


Translator’s note. This article is the third chapter of an essay by I. G. Bashmakova and G. S. Smirnova 
devoted to the rise and evolution of algebra. The whole essay is being translated by Abe Shenitzer and 
is being reviewed for publication by the Mathematical Association of America. 

The first two sentences of this article were added by the translator. 
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They had little regard for all of mathematics. Even accounting, surveying, and 
astronomical observations were left to the Greeks, the Syrians, and other con- 
quered nations. According to Vergil, the destiny of Romans was wise government 
of the world. 

The revival of the Alexandrian school was accompanied by a fundamental 
change of orientation of its mathematical research. During the Hellenistic period 
geometry was the foundation of Greek mathematics; algebra had not, as yet, 
become an independent science but developed within the framework of geometry, 
and even the arithmetic of whole numbers was constructed geometrically. Now 
number became the foundation. This resulted in the arithmetization of all mathe- 
matics, the elimination of geometric justifications, and the emergence and inde- 
pendent evolution of algebra. 

We encounter the return to numerical algebra already in the works of the 


outstanding mathematician, mechanician, and engineer Heron of Alexandria (ist . 


century AD). In his books Metrica, Geometrica, and others, books that resemble in 
many respects our handbooks for engineers, one finds rules for the computation of 
areas and volumes, solutions of numerical quadratic equations, and interesting 
problems that reduce to indeterminate equations. In particular, they contain the 
famous “Heron formula” for the computation of the area of a triangle given its 


sides a,b,c: 
S= yp(p—a)(p— b)\(p-—¢), 


where p = (a+b6+c)/2. Here the expression under the square root sign is a 
product of four segments, and thus an expression totally inadmissible in geometric 
algebra. It is clear that Heron thought of segments as numbers, whose products are 
likewise numbers. 

In his famous book, known under its Arabized name Almagest, Claudius 
Ptolemy, when computing tables of chords, identified ratios of magnitudes with 
numbers, and the operation of “composition ” of ratios—defined in Euclid’s 
Elements—with ordinary multiplication. | 

The new tendencies found their clearest expression in the works of Diophantus 
of Alexandria, who founded two disciplines: algebra and Diophantine analysis. 

We know next to nothing about Diophantus himself. On the basis of certain 
indirect remarks, Paul Tannery, the eminent French historian of mathematics, 
concluded that Diophantus lived in the middle of the 3rd century AD. On the 
other hand, Renaissance scholars who discovered Diophantus’ works, supposed 
that he lived at the time of Antoninus Pius, i.e., approximately in the middle of the 
2nd century. The epigram in Anthologia Palatina provides the following informa- 
tion: “Here you see the tomb containing the remains of Diophantus, it is remark- 
able: artfully it tells the measures of his life. God granted him to be a boy for the 
sixth part of his life, and adding a twelfth part to this, He clothed his cheeks with 
down; He lit him the light of wedlock after a seventh part, and five years after his 
marriage He granted him a son. Alas! late-born wretched child; after attaining the 
measure of half his father’s life, chill Fate took him. After consoling his grief by 
this science of numbers for four years he ended his life. By this device of numbers 
tell us the extent of his life.” A simple computation shows that Diophantus died at 
the age of 84 years. This is all we know about him. | 


2. DIOPHANTUS’ Arithmetica. ITS DOMAIN OF NUMBERS AND SYMBOL- 


ISM. Only two (incomplete) works of Diophantus have come down to us. One is 
his Arithmetica (six books out of thirteen; four more books in Arabic, attributed to 
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Diophantus, were found in 1973. They will be discussed in the sequel). The other is 
a collection of excerpts from his treatise On polygonal numbers. We discuss only the 
first of these works. 

Arithmetica is not a theoretical work resembling Euclid’s Elements or Apollo- 
nius’ Conic sections but a collection of (189) problems, each of which is provided 
with one or more solutions and with relevant explanations. At the beginning of the 
‘first book there is a short algebraic introduction, which is basically the first account 
of the foundations of algebra. Here the author constructs the field of rational 
numbers, introduces literal symbolism, and gives rules for operating with polynomi- 
als and equations. 

Already Heron regarded positive rational numbers as legitimate numbers (in 
classical ancient mathematics “number” denoted a collection of units, ie., a 
natural number). While Diophantus defined a number as a collection of units, 
throughout Arithmetica he called every positive rational solution of one of his 
problems “number” (a p.Ouds), i.c., he extended the notion of number to all of 
Q*. But this was not good enough for the purposes of algebra, and so Diophantus 
took the next decisive step of introducing negative numbers. It was only then that 
he obtained a system closed under the four operations of algebra, 1.e., a field. 

How did Diophantus introduce these new objects? Today we would say that he 
used the axiomatic method: he introduced a new object called “deficiency” (Aetws, 
from AEet7w—to lack) and stated rules for operating with it. He writes: “deficiency 
multiplied by deficiency yields availability (i.e., a positive number (the authors)); 
deficiency multiplied by availability yields deficiency; and the symbol for deficiency 
is cy, an inverted and shortened (letter) yw” (Diophantus. Arithmetica. Definition 
IX). In other words, he formulated the rule of signs, which we can write as follows: 

= ar es 
(Cra): 

Diophantus did not formulate rules for addition and subtraction of the new 
numbers but he used them extensively in his books. Thus, while solving problem 
III, (i.e., Problem 8 in Book IID, he needs to subtract 2x + 7 from x* + 4x + 1. 
The result is x* + 2x — 6, ie., here he carries out the operation 1 — 7 = —6. In 
problem Vhia 90 — 15x? is subtracted from 54 and the result is 15x* — 36. Thus 
here 15x* is subtracted from zero; in other words, Diophantus is using the rule 
Se ee r 

We note that Diophantus used negative numbers only in intermediate computa- 
tions and sought solutions only in the domain of positive rational numbers. A 
similar situation developed later in connection with the introduction of complex 
numbers. Initially they wets regarded as just convenient symbols for obtaining 
results involving “genuine,” 1.e., real, numbers. 

Diophantus also introduced literal signs for an unknown and its powers. He 
called an unknown a “number” (a p.@uds) and denoted it by the special symbol s. 
It is possible that this symbol was introduced before him. We find it in the 
Michigan papyrus (2nd century AD) as well as in a table appended to Heron’s 
Geometrica. But Diophantus boldly breaks with geometric algebra by introducing 
special symbols for the first six positive powers of the unknown, the first six negative 
powers, and for its zeroth power. While the square and cube of the unknown could 
be interpreted geometrically, its 4th, 5th, and 6th powers could not be so repre- 
sented. Nor could the negative powers of the unknown. 

Diophantus denoted the positive powers of the unknown as follows: 


x—s; x7? AY; x°9—K*; x7 AYA; 4° AK”; x°—K’K. 
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He defined negative powers as inverses of the corresponding positive powers 
and denoted them by adding to the exponents of the positive powers the symbol y. 
For example, he denoted x~* = 1/x? by A™. : 

He denoted the zeroth power of the unknown by the symbol M, that is by the 
first two letters in MOvas, or unity. 

Then he set down a “multiplication table’ for powers of the unknown that can 
be briefly written as follows: 


x™x"™ =x™"" -6<min<6. 


He singled out two rules that correspond to the two basic axioms that we use for 
defining a group: | 


x™-1=x™ (definition VII); (1) 
x™x"™ = 1 (definition VI). (2) 


In addition, Diophantus used the symbol to for equality, and the symbol 0 for 
an indeterminate square. All this enabled him to write equations in literal form. 
Since he did not use a symbol for addition, he first set down all positive terms, 
then the minus sign (i.e., -y), then the negative terms. For example, the equation 


x? —2x?+10x-1=5 
was written as 
K’ast h AB Maio Me. 

Here @ = 1,1 = 10, B = 2, € = 5 (we recall that the Greeks used the letters of 
the alphabet to denote numbers). 

In the “Introduction” Diophantus formulated two basic rules of transformation 
of equations: 1) the rule for transfer of a term from one side of an equation to the 
other with changed sign and 2) reduction of like terms. Later, these two rules 
became well known under their Arabized names of al — jabr and al — mugqabala. 

Diophantus also used the rule of substitution in a masterly way but never 
formulated it. 

We can say that in the introduction Diophantus defined the field Q of rational 
numbers, introduced symbols for an unknown and its powers, as well as symbols 
for equality and for negative numbers. 

Before discussing the contents of Arithmetica we consider the possibilities and 
limitations of Diophantus’ symbolism. Getting ahead of the story, we can say that, 
basically, Diophantus considered in his work indeterminate equations, i.e., equa- 
tions with two or more unknowns. But he introduced symbols for just one unknown 
and its powers. How did he proceed when solving problems? 

First he stated each problem in general form. For example: “To decompose a 
square into a sum of squares” (problem II,). Now we would write this problem as 


oy Sa". 
How could Diophantus write this equation with just one symbol for an unknown 
and without symbols for parameters (in this case a)? He proceeded as follows: 
after the general formulation he assigned concrete values to the parameters—in 
the present case he put a* = 16. Then he denoted one unknown by his special 
symbol (we will use the letter ¢ instead) and expressed the remaining unknowns as 
linear, quadratic, or more complex rational functions of that unknown and of the 
parameters. In case of the present example, one unknown is denoted by ¢ and the 
other by At — a or, as Diophantus puts it, “a certain number of ¢’s minus as many 


60 THE EVOLUTION OF... [January 


units as are contained in the side of 16,’’i.e., instead of a he takes 4 and instead of 
the parameter k—the number 2. But by saying “a certain number of ?’s” he 
indicates that the number 2 plays the role of an arbitrary parameter. Thus 
Diophantus’ version of our equation is 


t? + (2t — 4)° = 16, 
so that 
x=t=16/5; y=2t-—4=12/5. 

One might think that Diophantus was satisfied with finding a single solution. 
But this is not so. In the process of solving problem III,, he finds it necessary to 
decompose a square into two squares. In this connection he writes: “We know that 
a square can be decomposed into a sum of squares in infinitely many ways.” 

The use of a concrete number to denote an arbitrary parameter has the virtue 
of simplicity. Sometimes it turned out that the parameter could not be selected 
arbitrarily, that it had to satisfy additional conditions. In such cases Diophantus 
determined these conditions. Thus problem VI, reduces to the system 


xpt+x,=y, x, +x, =y. 
Diophantus puts x, = t, x; = Bt, where B = 2. Then from the second equation we 
obtain y = (6 + 1)t, and from the first 
1 
— ae Eee ? 
C81) =p" | 

Since B = 2, t? = 1/19, ie., t is not rational. In order to obtain a rational 

solution Diophantus looks at the way t* is expressed in terms of the parameter B. 


The expression in question is a fraction whose numerator, 1, is a square. But then 
the denominator must also be a square: 


(B+1)-pe=0. 
Diophantus took as the new unknown B = 7 (he denoted it by the same symbol as 
the original unknown x,) and obtained 


t2 


37° +37+1=0. 


Solving this equation by his method (which we will describe in detail in the next 
section) Diophantus obtained 

a 2A 

Nv — 3° 
i.e., the parameter could only be chosen from the class of numbers {(3 + 2A)/ 
(A* — 3)}. Diophantus takes A = 2 and obtains 6 = 7. Then he goes back to 
solving the original problem. 

Diophantus often deliberately chooses for parameters numbers that do not lead 
to solutions. He does this in order to show how to analyze problems. 

Thus concrete numbers play two roles in Arithmetica. One role is that of 
ordinary numbers and the other is that of symbols for arbitrary parameters. 
Numbers were destined to play the latter role almost to the end of the 16th 
century. 

Time to sum up. Diophantus was first to reduce determinate and indeterminate 
problems to equations. We may say that for a large class of problems of arithmetic 
and algebra he did the same thing that Descartes was later to do for problems of 
geometry, namely he reduced them to setting up and solving algebraic equations. 


1999] THE EVOLUTION OF... 61 


Indeed, in order to solve problems—arithmetical in the case of Diophantus and 
geometric in the case of Descartes—both set up algebraic equations that they 
subsequently transformed and solved in accordance with the rules of algebra. Also, 
the transformations involved— such as elimination of unknowns, substitutions, and 
reduction of similar terms—had no direct arithmetic or geometric significance and 
were not subject to extensive interpretations. In both cases such interpretations 
were reserved for the final results. We are used to associating this important step 
with Descartes’ creation of analytic geometry, but long before Descartes this step 
was taken by Diophantus in his Arithmetica. Before exposure to Arithmetica, none 
of the scholars of the period between the 13th and 16th centuries entertained the 
idea of applying algebra to the solution of number-theoretic problems. 


3. THE CONTENTS OF Arithmetica. DIOPHANTUS’ METHODS. We have al- 
ready mentioned that Arithmetica is a collection of problems with solutions. This 
may create the impression that it is not a theoretical work. But a more careful 
reading makes it clear that the purpose of the painstaking choice and deliberate 
placement of problems was to illustrate the application of specific general meth- 
ods. It is a characteristic of ancient mathematics that methods were not formulated 
apart from problems but were disclosed in the process of their solution. We recall 
that the famous “method of exhaustion’—the first variant of the theory of 
limits—was not set down in pure form either by its author Eudoxus of Cnidus or 
by Archimedes. It was mathematicians of the 16th and 17th centuries who isolated 
it by analyzing Euclid’s Elements and Archimedes’ quadratures and formulated it 
in general terms. The same applies to Diophantus’ Arithmetica. As we show in the 
sequel, his methods were isolated in the 16th and 17th centuries by Italian and 
French mathematicians. Following them, we will try to isolate some of these 
methods and state them in general form. 

In Book I Diophantus solved particular determinate linear and quadratic 
equations. The remaining books deal with the solution of indeterminate equations, 
i.e., equations of the form | 


PUK awe he) a Oy. eS 2: 


F(x,,...,,) a polynomial, or of systems of such equations: 


F(X1,.--,X,) =0, kK<n. 


Diophantus looks for positive rational solutions x}, x3,...,x°, x2 € Q*, of such 
equations or of such systems. 

It is clear that to solve his determinate equations Diophantus needed only 
symbols for x and x* and not for x”, —6 <n < 6. In other words, he extended his 
domain of numbers and introduced most of his symbols to investigate and solve 
indeterminate equations, where he really needed higher powers of the unknown as 
well as its negative powers. 

Thus the birth of literal algebra was connected not with determinate but with 
indeterminate equations. 

Here we present just one of Diophantus’ methods, namely his method for 
finding the rational solutions of a quadratic equation in two unknowns: 


F(x, y) = 0, (3) 


where F(x, y) is a quadratic polynomial with rational coefficients. 


62 THE EVOLUTION OF... [January 


Basically, Diophantus proves the following theorem: if equation (3) has a 
rational solution (xy, y)) then it has infinitely many such solutions (x, y), and x 
and y are both rational functions (with rational coefficients) of a single parameter: 


x= (k), y= (k). (4) 
When presenting his methods we use modern algebraic symbolism. This is by 
now a Standard procedure in historical-mathematical literature. 
Diophantus began by considering quadratic equations of the form 
y>=ax’+bx+c, a,b,c EQ, (5) 
and put c = m* (in other words, he assumed that the equation had two rational 
solutions (0, 7) and (0, —m)). To find solutions he made the substitution 
y=kx+m (6) 
and obtained 
b+ 2km b+ 2km 
nt a 
By assigning to k all possible rational values (Diophantus took only values that 
yielded positive x and y) we obtain infinitely many solutions of equation (5). 


We note that the substitutions (6) are the famous Euler substitutions that are 
applied to integrals of the form 


+m. 


| dx 
Vaxz + bx tc. 


We mentioned earlier problem II,, which reduces to the equation 


x*+y* =a’, (7) 
and recall that Diophantus solved it by making the substitution 
x=t; y=kt—a, ; (8) 
and obtained (we are replacing his numerical values by appropriate letters) 
2k k*-1 
x=t= Tarr eae 


To see the sense of this solution and to appreciate its generality we must look at 
its geometric interpretation. Equation (7) determines a circle of radius a centered 
at the origin, and the substitution (8) is the equation of a straight line with slope k 
passing through the point A(0, —a) on that circle (Figure 1). It is clear that the 
straight line (8) intersects the circle (7) in another point B with rational coordi- 
nates. Conversely, if there is a point B, with rational coordinates (x,, y,) on the 
circle (7) then AB, is a straight line of the pencil (8) with rational slope k. Thus to 
every rational k there corresponds a rational point on the circle (7) and to every 
rational point on the circle (7) there corresponds a rational value of k. Hence 
Diophantus’ method yields all rational solutions of equation (7). 

This argument shows that a conic with a rational point is birationally equivalent 
to a rational straight line. 

Next Diophantus considered the more general case when equation (5) has a 
rational point but the coefficient c is not a square. He first considered this case in 
problem II,, which reduces to the equation 


xe +y*=a’*+bd? (9) 
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Figure 1 


(Diophantus put a = 2, b = 3). It is clear that equation (9) has the following four 
solutions: (a, b), (—a, b), (a,— b), and (—a, —b). Diophantus makes the substitu- 
tion | 

x=tt+a, y=kt—b (10) 


and obtains t = 2(bk — a)/(1 + k*). Applying a geometric interpretation analo- 
gous to the one just used we see that, essentially, he is leading a straight line with 
slope k through (a, —b) on the circle (9). 

Diophantus considered a more general case in lemma 2, proposition VI,, and in 
the lemma for proposition VI,;: assuming that equation (5) has a rational solution 
(Xo, Yo) he made the substitution x = t + x, and obtained 

y* =at? + (2ax, + b)t + yA, 
i.e., he reduced the problem to the case c = m’. 

Finally, he considered equation (5) in the case when a = a’. He made the 

substitution (easily recognized as Euler’s “second substitution” (the authors )) 


y=attk (11) 

and obtained | 

c —k? 

t= ———_.. 
2ak —b 
This case calls for a separate discussion. To understand why the straight line 
(11) intersects the conic section (5) in just one point we introduce projective 
coordinates (U,V, W) by putting x = U/W, y = V/W, i.e., we consider our conic 
in the projective plane P*. Then equation (5) takes the form 

a’U* + bUW + cW?. (12) 


The curve L so defined intersects the line at infinity W = 0 in two rational points: 
(,a,0) and (1, —a,0). The straight line (11), whose equation in projective 
coordinates is 


V=aU+kw, 
passes through the first of these points. 
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In summary, we can say that Diophantus carried out a complete investigation of 
a quadratic indeterminate equation in two unknowns. Later, his analysis served 
as a model for the investigation of the question of rational points on curves of 
genus 0. — 

Diophantus used more complex and more sophisticated methods to solve 
equations of the form 


= ax? + bx*+cx+d, 
y? =ax? + bx? +x +d, 
yr=ax'+ber tc’? +atf, 
and systems of the form 


ax? +bx+c=y’, 
a,x*+b,x4+¢, =2’, 


which he called “double equalities.” Readers interested in getting a deeper 
understanding of Diophantus’ methods should consult the book by Bashmakova 
and Slavutin: A history of Diophantine analysis from Diophantus to Fermat (Russian) 
which contains further references to the literature. The history of Diophantus’ 
methods extends all the way to the papers of Poincaré that appeared at the 
beginning of the 20th century. It was on the basis of these methods that Poincaré 
constructed an arithmetic of algebraic curves—an area of mathematics that is 
being intensively developed at the present time. 

We conclude our survey by considering Diophantus’ problem III ,o. This prob- 
lem reduces to a system of 8 equations in 12 unknowns: 


(xy +x, + x3 + i. +x; =y/, 
(X, +X, +x +x,)° —x,=27; i=1,2,3,4. 


Diophantus notes that “in every right triangle the square of the hypotenuse 
remains a square if we add to it, or subtract from it, twice the product of its legs.” 
This means that he must find four right triangles with the same hypotenuse. 
Indeed, let the sides of the four triangles be a,,b,,c, i = 1,2,3, 4. Then it suffices 
to put x, +x, +x, +x,=ct, x; = 2a,b, i= 1,...,4. Thus the problem re- 
duces to finding a number c that can be written as a sum of two squares in four 
different ways. Diophantus solves this essentially number-theoretic problem as 
follows: he takes two right triangles with sides 3, 4,5 and 5, 12, 13 respectively, and 
multiplies the sides of each of them by the hypotenuse of the other. As a result he 
obtains two right triangles with the same hypotenuse: 39, 42, 65 and 25, 60, 65. Now 
5 = 1? + 2% and 13 = 2? + 3”. Using the rule for composition of forms u* + v* 
known already to the Babylonians, namely 


(u? + v?)(a? + B?) = (au — Bv)’ + (av+ Bu)’ 
= (au + Bv)’ (av = Bu)’, 
he obtains 
65 = 5-13 = (1 + 27)(274+ 37) =44+7=8 +1. 


Using Euclid’s formulas for the general solution of x? + y? = z* (ie., z = p* + q’; 
x = p* —q*; y =2pq) we obtain two more right triangles with hypotenuse 65: 
33, 56,65 and 63, 16,65. This completes the solution of the problem. 
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In connection with this problem Fermat stated that a prime of the form 4” + 1 
could be written as a sum of squares in just one way. Then he gave a formula for 
the determination of the number of ways in which a given number can be written 
as a sum of squares. Thus problems involving indeterminate equations led to 
number-theoretic insights. 

Did Diophantus know the theorems formulated by Fermat? It is possible that 
he did. Jacobi offered a reconstruction of Diophantus’ conjectured proofs, but the 
answer to this question remains hypothetical. 

One can hardly overestimate the significance of Diophantus’ Arithmetica for the 
subsequent history of algebra. It is no exaggeration to say that its role was 
comparable to the role of Archimedes’ treatises in the history of the differential 
and integral calculus. We will see that it was the starting point for all mathemati- 
cians up to Bombelli and Viéete, and that its importance for number theory and for 
indeterminate equations can be traced up to the present. 


4. ALGEBRA AFTER DIOPHANTUS. The period from the 4th to the 6th cen- 
turies AD was marked by the precipitous decline of ancient society and learning. 
But eminent commentators, such as Theon of Alexandria (second half of the 4th 
century) and his daughter Hypatia (murdered in 418 by a fanatical Christian mob), 
were still active. In the 5th century there was an exodus of scholars from 
Alexandria to Athens. Finally, in the 6th century, Eutocius and Simplicius, the last 
of the great commentators, were expelled from Athens and settled in Persia. 

We can turn to the question of the Arabic translations of four books attributed 
to Diophantus. An analysis of these books, translated at the end of the 9th century 
from Greek to Arabic by Costa ibn Luca (i.e., the Greek Constantin, son of Luca) 
shows that they are a reworked version of Diophantus’ Arithmetica. They contain 
problems, possibly due to Diophantus, as well as extensive additions and commen- 
taries to them. According to Suidas’ Byzantine dictionary, Hypatia wrote commen- 
taries on Arithmetica. It is therefore very likely that the four books translated into 
Arabic are books edited and provided with commentaries by Hypatia. These books 
contain no new methods, but the material is presented in a complete and 
systematic manner. Their author went beyond Diophantus by introducing the 8th 
and 9th powers of the unknown. 

The subsequent development of mathematics, including that of algebra, was 
connected with the Arabic East. Scholars from Syria, Egypt, Persia, and other 
regions conquered by the Arabs wrote scientific treatises in Arabic. 
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PROBLEMS AND SOLUTIONS 


Edited by Gerald A. Edgar, Daniel H. UlIman, and Douglas B. West 


with the collaboration of Paul T. Bateman, Mario Benedicty, Paul Bracken, Duane M. Broline, Ezra 
A. Brown, Richard T. Bumby, Glenn G. Chappell, Randall Dougherty, Roger B. Eggleton, Ira M. Ges- 
sel, Bart Goddard, Jerrold R. Griggs, Douglas A. Hensley, Richard Holzsager, John R. Isbell, Robert 
Israel, Kiran S. Kedlaya, Murray S. Klamkin, Fred Kochman, Frederick W. Luttmann, Frank B. Miles, 
Richard Pfiefer, Leonard Smiley, John Henry Steelman, Kenneth Stolarsky, Richard Stong, Charles 
Vanden Eynden, and William E. Watkins. 


Proposed problems and solutions should be sent in duplicate to the MONTHLY 
problems address on the inside front cover. Submitted problems should include 
solutions and relevant references. Submitted solutions should arrive at that address 
before June 30, 1999; Additional information, such as generalizations and refer- 

ences, is welcome. The problem number and the solver’s name and address should 
appear on each solution. An acknowledgement will be sent only if a mailing label 
is provided. An asterisk (*) after the number of a problem or a part of a problem 

indicates that no solution 1s currently available. 


PROBLEMS 


10704. Proposed by Wiliam G. Spohn, Jr. Ellicott City, MD. Show that there are infinitely 
many pairs ((a, b, c), (a’, b’, c’)) of primitive Pythagorean triples such that ja — a’|, |b — 
b’|, and |c — c’| are all equal to 3 or 4. Examples include ((12,5, 13), (15, 8, 17)) and 
((77, 36, 85), (80, 39, 89)). 


10705. Proposed by D. W. Brown, Marietta, GA. A topological space has the fixed point 
property if every continuous function from the space to itself has a fixed point. Is there a 
countably infinite Hausdorff space with the fixed point property? 


10706. Proposed by James G. Propp, University of Wisconsin, Madison, WI. Given a finite 
sequence (ad), ..., Qn), define the derived sequence (by, ...,bn+\) by bj = s — aj_-) — aj, 
where s = min) <j<n41(@j-) + @j) + Max) <j<y)+1(@j-) + .;) and where we interpret both 
ao and ayj+ 1 as 0. Let So be the sequence (1) of length 1, and for n > 1 define S, to be the 
derived sequence obtained from S,_;. Thus S; = (1, 1), $2 = (2, 1, 2), $3 = (3, 2, 2, 3), 
and Sq = (5, 3, 4, 3, 5). Show that the middle term of $2, is a perfect square. 


10707. Proposed by John Isbell, State University of New York, Buffalo, N Y. Show that 
(a) no vector space over an infinite field is a finite union of proper subspaces; and 
(b) no vector space over an n-element field 1s a union of n or fewer proper subspaces. 


10708. Proposed by the Western Maryland College Problems Group, Westminster, MD. Let 
1 f7 1 1 — cos(x + f) 
= — —| —_—_—___—_——. } dt 
Pe) if t og (tO 
for x € (0, 7). 


(a) Find the Fourier sine series for f. 
(b) Find the L? norm of f. 
(c) Find lim,-.0 f(x). 
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10709. Proposed by Zoltan Sasvdari, Technical University of Dresden, Dresden, Germany. 
Let X be a standard normal random variable, and choose y > 0. Show that 


on Pri(a <X<a+t y) 2 oay+(1/2)ay? 
Pr(a—y<X <a) 


when a > 0. Show that the reversed inequalities hold when a < 0. 


10710. Proposed by Bogdan Suceava, Michigan State University, East Lansing, MI. Let 
ABC be an acute triangle with incenter J, and let D, E, and F be the points where the circle 
inscribed in ABC touches BC, CA, and AB, respectively. Let M be the intersection of the 
line through A parallel to BC and DE, and let N be the intersection of the line through 
A parallel to BC and DF. Let P and Q be the midpoints of DM and DN, respectively. 
Prove that A, E, F, 7, P, and QO are on the same circle. 


SOLUTIONS 


When O-H-I Is Isosceles 


10547 [1996, 695]. Proposed by Dan Sachelarie, ICCE Bucharest, and Vlad Sachelarie, 
University of Bucharest, Bucharest, Romania. In the triangle ABC, let O be the circum- 
center, H the orthocenter, and J the incenter. Prove that the triangle OH is isosceles if 
and only if ~ 

e+bh+c _ R 


3abc Or’ 


Solution by Walther Janous, Ursulinengymnasium, Innsbruck, Austria. We denote by MPV 
the reference D. S. Mitrinovic¢, J. E. Pecéari¢, and V. Volenec, Recent Advances in Geometric 
Inequalities, Kluwer, 1989. Neither JO nor HI is ever as large as HO [MPV, p. 288], so 
the only way triangle J H O can be isosceles is if 1O = HI. Also 10% = R* —2Rr [MPV, 
p. 279] and HI* = 4R* + 4Rr + 3r? — s* [MPV, p. 280], where s is the semiperimeter. 
Hence HI = IO if and only if R* — 2Rr = 4R? + 4Rr + 3r? — s*. This rearranges to 
2s(s*—3r?—6Rr)/12Rrs = R/2r, or, using abe = 4Rrs [MPV, p. 52] anda? +b? +c? = 
2s(s? — 3r2 — 4Rr) [MPV, p. 52], to (a? + b° +. c?)/3abe = R/2r. 


Editorial comment. Another condition equivalent to HI = IO, given in problem E2282 
[1971, 196; 1972, 397] from this MonTHLy, is that ABC has one angle equal to 60°. 
Solved also by J.. Anglesio (France), R. Barbara (Lebanon), F. Bellot Rosado (Spain), C. W. Dodge, J, S. Frame, Z. Franco, 


M. S. Klamkin (Canada), W. W. Meyer, V. Mihai (Canada), C. R. Pranesachar (India), B. Prielipp, V. Schindler (Germany), 
I. Sofair, M. Tabadé (Morocco), T. V. Trif (Romania), M. Vowe (Switzerland), GCHQ Problems Group (U. K.), and the proposers. 


The Divisible Differences Property 


10553 [1996, 809]. Proposed by Bjorn Poonen, Mathematical Sciences Research Institute, 
Berkeley, CA, Jim Propp, Massachusetts Institute of Technology, Cambridge, MA, and 
Richard Stong, Rice University, Houston, TX. Say that a sequence (q) = q1, 41, 42,... 
of integers has the divisible differences property if (n — m)|(qn — Gm) for all n and m. 

(a) Show that if (q) has the divisible differences property and lim sup |g,|!/" < e — 1, then 
there is a polynomial Q such that gq, = Q(n). 

(b) Show that there is a sequence (q) that has the divisible differences property and satisfies 
lim sup |gn|!/" < e, for which gp is not given by a polynomial in n. 

(c)* Is it true that lim sup |q,|!/" > e for all non-polynomial (q) with the divisible differences 
property? 


68 | PROBLEMS AND SOLUTIONS [January 


Solution of parts (a) and (b) by the GCHQ Problem Solving Group, Cheltenham, U. K. Let 
Ln = |lcem{1,...,n}. We need three facts. 

Fact 1: log(L,) ~ nasn —> oo. 

Proof: If p is prime and p < n, then p!!°&! is the. highest power of p that divides 
n. Therefore, Ly = lies pulcep”) 2 cp pier = gee n = n™™. Thus, 
log(L,) < a(n) logn ~ n, the latter following from the Prime Number Theorem. Con- 
versely, if 1/2 <r <1, then Ly > [[prepen P > Hnrepen”” = ni @(n)—-2("") Taking 
logarithms and using the Prime Number Theorem yields log(L,) > r(a(n)—2(n")) logn ~ 


r 


i (ts - te) logn =rn—n' ~rn. Sincer is arbitrary, log(L,) ~n. O 


Fact 2: Given qo, q1,.--, Qn—1, the choices for gy, such that (n—m)|(qn— qm) forO < m <n 
lie in the same congruence class modulo Ly. 
This is a consequence of the Chinese Remainder Theorem. 
Fact 3: If go, q1,.--,@n—1 has the divisible differences property, and if r, is the value 
found by fitting a minimum degree polynomial to go, gq, ..., Gn—1 and extrapolating, then 
rn = ye C1) ani and (n — m)|(rn — dm) forO < m <n. 
Proof: The formula for r, is just a restatement of the property that the nth difference of 
90; G1,-++» Gn—1 1S 0. We prove the rest by induction on n, it being trivial when n = 1. 
The first difference of go, G1,...,@n—1 18 a sequence of length n — 1 with the divisi- 
ble differences property, and its polynomial extrapolation is r, — gn— 1. By the inductive 
hypothesis, m|(Tn — Gn—1) — (Qn—m — Gn—m—1) for m > 0, and by the hypothesis on 
G0, G1>-++»Gn—1 We have m|(qn—1 — Gn—m—1). Hence m|(™m — qn—m), SO We May assume 
m = 0. By subtracting qo from the rest of the sequence, we may assume go = 0, so 


n—| 
'n = dv (1) ani (1) 


Now n|(n — i)(") since (n —i)(7) = ne) and (n — i)|qn—j, so n divides each term of 
the sum in (1) and therefore r,. 0 


We can restate the divisible differences property as: Lyn|(@n — rn) for all n. 
(a) If lim sup |g,|!/" < e — 1, then there is an € > O and an integer no such that |q,|!/" < 
e — 1 — 3¢€ for n > no. From this and Fact 3, there is an ny > no such that |r,| < 
din (;)(€ — 1 —2€)"! < (e — 2€)" < (e — €)"/2 for n > ny. Using Fact 1, there is an 
nz > n, such that n > n2 implies that log(L,) > nlog(e — €) (that is, L, > (e — €)"). If 
n > ny and go, q1,.--+Gn—1 has the divisible differences property, r, is thus the only value 
for gn that would extend the property and also have absolute value less than (e — €)"/2. 
Since |gn| < (e — 1 — 3e)", it follows that gn = rn. 

This is the inductive step in a proof that the minimum degree polynomial that fits 
G0, +++» ny fits gn for all n. 
(b) Set go = 0. For each positive n, set gy, to be congruent to r, mod L, but of opposite 
sign and having magnitude less than L,. If r, = 0, however, set g, = Ly. Since gy never 


equals rn, gn is not given by a polynomial. Since |gn| < |Ly|, we have lim sup |q,|!/" < e. 


Editorial comment. The results of parts (a) and (b) appear in I. Ruzsa, On congruence 
preserving functions (in Hungarian), Mat. Lapok 22 (1971) 125-134 (Math. Reviews, vol. 48, 
#2044). Part (a) generalizes problem 4 from the 1995 USA Mathematical Olympiad, which 
imposed the stronger condition that g, is bounded by a polynomial. No solutions were 
submitted for part (c). 


Parts (a) and (b) solved also by R. J. Chapman (U. K.), K. S. Kedlaya, and the proposers. 
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Mathematics for Hikers 


10557 [1996, 902]. Proposed by Nick MacKinnon, Winchester College, Winchester, U. K. 
Naismith’s rule allows walkers to compute the time for their journeys. The time is given by 
allowing a walking speed of 4 km/hr, but adding an extra minute for each 10m of ascent. 
A conical mountain has base radius 1650m and vertical height 520m. Points A and B 
are diametrically opposite at the base of the mountain. How should a path be constructed 
between between A and B on the surface of the mountain that minimizes the time taken to 
walk from A to B? 


Solution by the proposer. The surface of the 
mountain can be unrolled isometrically 
onto a plane, forming a sector (of angle 
2a = 165/173 - 27 radians) of a circle 


(a — 20)l cosé 


(of radius / = 1730m, the slant height — A 
of the mountain), as shown in the 
figure at right. A’ 


Call the time-minimising path the 
Naismith geodesic for the cone. This 
geodesic must reach some maximum height 
h. The figure shows a potential Naismith 
geodesic with maximum height h = 520(1 — cos @) 
meters. It is composed of a circular arc following the contour with height h, together with 
tangents joining the arc to A and B. No alternative path taking less time attains the maximum 
height h of the given path since such a path must at least touch the circular arc, must not 
cross the circular arc, and must leave the circular arc without subsequent reascent. The 
Naismith geodesic must therefore be a path of the given shape. The length of such a path is 
21 sin @ + (a — 20)1 cos 0, and Naismith’s rule gives a time of 


t(@) = 0.015(2/ sin @ + (@ — 20)l cos 8) + 5201 — cos 8) /10 


minutes for the journey. The lone critical value of t(@) for O < 6 < @/2 occurs when 
t'(0) = (52 — 0.015(a@ — 26)1) siné6 = 0 at 6* = (a — 52/(0.0151)) /2 = .49623 radians. 
This gives the optimal time t(0*) = 76.71037 minutes. A path around the bottom of the cone 
takes t(0) = 77.75442 minutes, while the path of shortest distance takes t(a@/2) = 99.98929 
minutes. 


Solved also by J. Anglesio (France), J. E. Dawson (Australia), P. G. Kirmser, J. H. Lindsey II, R. Reynolds & M. Martinez, and 
P. Strafflin. 


A Matter of Adjustment 


10558 [1996, 902]. Proposed by Zhang Chengyu, Hubei University, Wuhan, China. Let 
p be a prime, and let k be a positive integer. Let aj,a2,...,a,« be any p* integers. 


We define the adjustment of these integers to be the p* integers by, b2,..., b,x, where 


pe» 
bj = aj41 + aj+2 +++: +aj+p, interpreting subscripts modulo p*. For example, if p = 2 
and k = 2, one adjustment of 1, 1, 3, 4 gives 4, 7,5, 2. Prove that after p* adjustments of 


A}, 42,...,Apk, the list consists entirely of integers divisible by p. 


Solution I by Thomas Jager, Calvin College, Grand Rapids, MI. We prove a stronger state- 
ment. Given an integer vector v = (vj,..., Uyk)s define the v-adjustment of (a1, ..., Anh)” 


to be (bj,..., bx)", where bj = vjaj41 + v2aj42 + +++ + Unk Aj 4. pk again treating sub- 

scripts modulo p*. As a transformation, the v-adjustment is represented by the matrix 
k F : F , : 

A = vjS + 28% +--+ uv, S? , where S is the permutation matrix for a cyclic shift by 
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one position. Hence 
nk nk pe yk , ae ae. 
A = (v1S +--+ v5 ) Su [+e +uy f= (Ui +-++ + vpe) I (mod p). 


Thus if vj +--+ + u,k = 0 modulo p, then p* applications of the v-adjustment matrix 
produces a vector of integers divisible by p. In the problem statement, the vector v consists 
of p ones and p* — p zeros. 


Solution II by J. H. van Lint, Eindhoven University of Technology, Eindhoven, the Nether- 
lands. Starting with ag, we construct an infinite sequence with aj = a;,,x. Over the field 


k 
F,, we consider the formal power series f(x) = on, ajx' = A(x)/(1—<x? ), where 
k_ Bo 
A(x) = - a;x' is a polynomial of degree less than p*. 
After one adjustment, the terms bo, bi, ... are the coefficients of x?+!, x?+?,... in the 


formal power series for 


ee 
(x dog? Pasay?) f(x) = a? Fe = x(1l — x)P—! F(x). 


1 
The result of n adjustments is the list of coefficients of x"?t), x"(P+D+! in the formal 
power series for 


nj — n(p-1) 
x"(1— xytP—D F(x) — A, 


which is a polynomial of degree less than np ifn(p—1) > p*. Thus the list consists entirely 
of integers divisible by p after n adjustments if n > p*/(p — 1). Noting that 


k k 
—] —2 
LAS a epee ce ee 
p-l p-1 p-\1 
is the least integer greater than or equal to p*/(p — 1), we see that the list consists entirely 
of integers divisible by p after n adjustments if n > (p* — 1)/(p—1) +1. As this is at 


most p*, the desired result follows. 


Editorial comment. David Callan proved that for positive m the list consists entirely of 
integers divisible by p™—! after mp*—! adjustments. In particular, after p* adjustments 
the list consists entirely of integers divisible by p?~!. Another consequence is that the list 
consists entirely of integers divisible by p after 2p*—! adjustments, but this is not as strong 
as the result proved by van Lint. 

Solved also by D. Beckwith, A. E. Caicedo Nifiez (Colombia), D. Callan, R. J. Chapman (U. K.), J. E. Dawson (Australia), 


W. Janous (Austria), K. S. Kedlaya, J. H. Lindsey II, R. Martin (Germany), A. Nijenhuis, J. C. Smith, H.-T. Wee (Singapore), 
GCHQ Problems Group (U. K.), and the proposer. 


Sets with Fixed Nim-Sum 


10564 [1997, 68]. Proposed by Proposed by Aviezri Fraenkel, Weizmann Institute of Science, 
Rehovot, Israel. The Nim-sum of two positive integers with binary expansions )_;..o aj2! 
and 0x9 di 2! is the number with binary expansion ) +0 Ci 2', where aj, b;, c; are in {0, 1} 
and c; = aj +b; mod 2. Letn bea positive integer, and let j be a nonnegative integer. How 
many of the 2” subsets of {1, 2, ..., a} have the property that their elements have Nim-sum 
equal to j? 


Solution by Reiner Martin, Deutsche Bank, London, U. K. Let [n] = {1,2,..., n}, and let 
A denote the symmetric difference operation. Let k = [log,(n + 1)]. There exists a subset 
of [n] whose elements have Nim-sum j only if 0 < j < 2*. We claim that the number of 
such subsets does not depend upon j and thus that this number is 2”~* for each such /. 
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To prove this, let }~;,,9 a2! be the binary expansion of j, and let Aj = {2/: a; = 1}. For 
each A C [n] with Nim-sum 0, let f(A) = AAA;. Note that f(A) has Nim-sum j. Since 
(AAA;)AA; = A, this map is a bijection into the set of subsets of [n] with Nim-sum j. 


Solved also by D. Beckwith, M. Benedicty, D. Berstein, J. C. Binz (Switzerland), M. Bowron, D. Callan, R. J. Chapman (U. K.), 
D. Donini (Italy), G. Gordon, R: Holzsager, K. S. Kedlaya, N. Komada, J. H. Lindsey II, J. Lorch, O. P. Lossers (The Netherlands), 
D. K. Nester, A. Nijenhuis, K. O’Bryant, M.-K. Siu (Hong Kong), J. H. Steelman, W. Stromquist, I. Vardi (Canada), H.-T. Wee 
(Singapore), M. Wolterman, Anchorage Math Solutions Group, GCHQ Problems Group (U. K.), NCCU Problems Group, NSA 
Problems Group, and the proposer. 


Generalized Line Bingo 


10565 [1997, 68]. Proposed by D. M. Bloom, Brooklyn College, Brooklyn, NY, and Kenneth 
Suman, Winona State University, Winona, MN. A rectangle is composed of mn squares 
arranged in m rows and n columns. In acertain game, the squares are selected one by one at 
random (without replacement). What is the expected number of selections until 7 columns 
of the rectangle are composed entirely of selected squares? (When j = 1, m = 5, and 
n = 15, this is the expected length of a type of bingo game known as a line game.) 


Composite solution by the GCHQ Problems Group, Cheltenham, U. K. and the editors. For 
fixed m and n, the required expectation E; equals mn 8 ae mi/(mi + 1). 

For each instance of the game, we can continue selecting squares at random until all 
squares are selected. Thus it suffices to compute, over all permutations of the mn squares, 
the expected length of the initial segment that completes 7 columns. We compute for each 
square x the probability that it belongs to that initial segment. This is independent of x, so 
the expectation is mn times this probability. 

Let A; be the event that x belongs to the initial segment in which i columns are completed; 
note that Pr(A,) = 1. The probability Pr(A;) is the product over i > j of Pr(A;|Aj+1). 

We partition A;+ ; into subevents that fix the trailing segment after the position where the 
(i + 1)st column is completed. In such a subevent S, the identities of the first i + 1 finished 
columns are fixed, but not which of these is last. 

For permutations in S, let B be the set of squares consisting of the first i finished columns 
and the last square that completes the (¢ + 1)st completed column. When x ¢€ B, it is 
equally likely to occupy any of the mi + 1 positions occupied by B, so the fraction of such 
permutations that belong to A; 1s mi/(mi + 1). 

When x ¢ B, we can group the permutations by each fixed permutation of B. Now x is 
equally likely to fall into each of the mi + 1 segments between members of B (or before 
the first). Again the fraction of these permutations that belong to A; is mi/(mi + 1). 


Editorial comment. The rows are unimportant. Victor Hernandez used linearity of expecta- 
tion and the inclusion-exclusion principle to obtain a formula in the more general situation 
where the columns are sets of arbitrary size. 


Solved also by R. J. Chapman (U. K.), D. A. Darling, V. Hernandez (Spain), R. Holzsager, J. H. Lindsey II, P. W. Lindstrom, 
N. C. Singer, J. C. Smith, J. H. Steelman, Anchorage Math Solutions Group, and the proposer. 


Ordered Trees and Stirling Numbers 


10570 [1997, 69]. Proposed by Emeric Deutsch, Polytechnic University, Brooklyn, NY. An 
ordered tree is a rooted tree in which the children of each node form a sequence rather than 
a set. The height of an ordered tree is the number of edges on a path of maximum length 
starting at the root. Let a(n, k) denote the number of ordered trees with n edges and height 
k, and let S(n, k) be the Stirling number of the second kind (the number of partitions of 
{1,2,...,n} into k nonempty parts). Note that a(n, 1) = S(n, 1), since both numbers are 
1. Show that (a) a(n, 2) = S(n, 2), (b) a(n, 3) + a(n, 4) = S(n, 3), and (c)* generalize 
these observations. 
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Solution I by Robin J. Chapman, University of Exeter, Exeter, UK. Let b(n, k) be the number 
of ordered trees with n edges and height at most k. We include the tree with a root and 
no edges, so b(0,k) = 1 fork > O. It suffices to show that b(n, 2) = 1+ S(n, 2) and 
b(n, 4) = 1+ S(m, 2) + S(m, 3) for n > 0. To achieve this, we compute the generating 
function g¢(x) = > 059 b(n, k)x” for0 <k < 4. 

We have go(x) = 1. Fork > 0, an ordered tree of height at most k consists of the root v, 
r edges incident to it, and a sequence of r ordered trees of height at most k — 1 rooted at the 
children of v. The generating function for ordered trees of height at most k in which the root 
has degree r is thus x’ (gx,_1(x))’. Summing over r, we obtain g,(x) = 1/(1 — xgx_1(x)). 
Explicitly, this yields 


1 ] —2x 
gi(x) = ee, 93(x) = 1—3x4 x2" 
1-—x 1—3x+x2 
= —_—_,, d = — 
g2(x) = oe and g4(x) fo 4x 4 3x2 
Expanding by partial fractions yields 
x x x 

See ee ee and = |] + — — 4+ ——_ 
gx(0) = 1+ EO paces T= 


Thus b(n, 2) = 2"—! and b(n, 4) = (3"~! + 1)/2 forn > 1. 

The number of partitions of [1] into at most two parts is half the number of subsets of 
[n], so 1+ S(n, 2) = 2”—!, as desired. Now consider partitions of [m] into at most three 
parts. Each element other than n enters the part with n or one of the other two. Thus 3”~! 
counts each partition with at least two parts twice, as we can interchange the second and 
third part without changing the partition. The partition with one part appears only once, so 
the total number of classes is (3”~! + 1)/2, as desired. 

Finding further relations among these numbers seem unlikely. If fg(x) = fe-1(x) — 
Xfg—2(x) for k > 2, with fo(x) = fi (x) = 1, then g(x) = f(x) /fe41(x) for all k. One 
can show that 


Lk/2] (k—j\ , “2! + it 
x)= —1)/ xi = 1 — 4x cos 
fi (x) yw(*5’) IT ( =) 


j=l 


It follows that 
L(k+1)/2] 


Zz 2) 
b(n, k) = rk, j (4 COS ) 
j=l aie 


for constants r;,;. On the other hand, it is well known that S(n,k) = ei t,,; J” for 


constants t ;. Whenk ¢ {0, 1, 2, 4}, the value 4 cos” j/(k + 2) is irrational for some /, 
so there is little hope of establishing a simple relationship betweeen b(n, k) and the Stirling 
numbers in these cases. 


Solution II to parts (a) and (b) by Daniele Donini, Bertinoro, Italy. We define a bijection 
from the set of ordered trees with n edges and height at most 4 to the set of partitions of [7] 
with at most 3 blocks, in which for k < 3 the trees with height k become partitions with k 
blocks, and the trees with height 4 become partitions with 3 blocks. 

Given a tree 7, label the non-root vertices with the integers 1 through n in order via a 
depth-first left-first search. To form the corresponding partition, let the ith block consists of 
the label on vertices at distance i for the root, except as follows. In each subtree rooted at a 
vertex at distance 3 from the root, put the largest value in block 3 and put the other values 
in block 1. This reduces to partitioning by levels for trees with height 3. 
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For the inverse map, index the blocks in partition 7 in increasing order of their least 
elements: A,, possibly Az, possibly A3. Build the corresponding tree traversal (starting 
with 1 as the leftmost vertex at level 1) as follows. Suppose that the label just processed 
was label k, belonging to A;. Let A; be the block containing label k + 1. 

Case 1: j < i. Make k + 1 anew rightmost leaf at level /. 

Case 2: j =i+ 1. Make k + 1 the first child of the vertex with label k. 

Case 3: j = 3 andi = 1. Because min A? < min A3, there exists a label less than k in A2. 
Let / be the largest such label less than k. Let m be the least label such that all labels from 
m to k lie in Aj; note that m > 1. Remove m,...,k from level 1. Make m the rightmost 
child of I (at level 3). Make m+ 1,...,k + 1 children of m (at level 4). 

Applying the original map to the resulting tree puts each label back into its block in z. 
As an example, the tree corresponding to A; = {1,6,7, 10, 12}, Az = {2,4,9, 11}, and 
A3 = (3,5, 8, 13} 1s shown. 


Solved also by D. Callan, R. Holzsager, the Anchorage Math Solutions Group, and the proposer. 


Lattice Points Inside a Triangle 


10600 [1997, 566]. Proposed by Franz Rothe, University of North Carolina, Charlotte, NC. 
(a) Suppose a triangle has its vertices at integer lattice points in the plane and contains 
exactly 3 integer lattice points in its interior. Show that the center of mass of the triangle is 
not an integer lattice point. 

(b)* Find all values i such that, if a triangle has its vertices at integer lattice points in the 
plane and contains exactly i integer lattice points in its interior, then the center of mass of 
the triangle cannot be an integer lattice point. 


Solution of part (a) by Robin J. Chapman, University of Exeter, Exeter, U. K. Let the vertices 
of the triangle be A, B, and C, with position vectors a, b, and c, respectively. Suppose that 
the centroid of the triangle has integer coordinates. This centroid is (1/3)(a+b+c). Let 
D denote twice the area of the triangle. Then 


D = |(b—a) x (c—a)| = |(c—b) x (a—b)| = |(a—c) x (b—c)| = lax b+bxc+cexal 


where x denotes the vector product. By assumption (1/3)(a+b-+c) has integer coordinates; 
therefore, so has (1/3)(a + b + c) — a = (1/3)((b — a) + (c — a)). Hence (c — a) = 
—(b — a) + 3d, where d has integer coordinates, and so D = |(b — a) x (ec — a)| = 
3|(b—a) x d| is amultiple of 3. Letr,s, and t be the largest integers such that (1/r)(b—c), 
(1/s)(c — a), and (1/t)(a — b) have integer coordinates. Then the interiors of the sides 
BC, CA, and AB contain respectively r — 1, s — 1, and t — 1 lattice points. By Pick’s 
Theorem D = 2N, + N2 — 2, where N is the number of lattice points in the interior of 
the triangle and N2 is the number of lattice points on its boundary (including the vertices). 
Consequently D=4+r+s+t. Also 


b-—a c-—a 


D=st x 


? 


AY 


and so D is divisible by st. Similarly, D is divisible by rs and rt. 
Since D is divisible by 3, we have r + s +t = 2 (mod 3). We now show that none of 
r, S, t is divisible by 3. Suppose that r is divisible by 3. Thenb—c = (b—a)+(a—c) = 3e 
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for some vector e with integer coordinates. But we already know that (b—a)—(a—c) = 3d. 

Hence both b — a and a — ¢ are integer vectors multiplied by 3. Thus s and ¢ are divisible 

by 3, contradicting r-+s+t =2 (mod 3). It follows that one of r, s, or t is congruent to 1 

modulo 3, and the others are congruent to 2. Also D must be divisible by 3rs, 3st, and 3rt. 
We may assume thatr > s >t. Now 3rs < D=4+r+4+5s +t, and so 


(3t — 1)* < Br — 1)(3s — 1) = Ors — 3r —35 +1 < 1343t¢. 


This inequality is false fort > 2, sot = 1. Therefore r = 2 (mod 3) and also 


s= 
(3r — 1)(3s — 1) < 16. Together, these imply that r = s = 2, and so D = 9. Now D is not 
divisible by 3rs, so we have a contradiction. 


Editorial comment. The proposer discovered the following result: Let a triangle have vertices 
at integer lattice points (0,0), (b1, b2), and (c2,c2). Let a = gced(bj — c2, bz — cp), 
B = gcd(bo, b2), and y = gcd(cj, cz). The center of mass 1s a lattice point if and only if 
either (i) 3 is a divisor of all three numbers a, f, and y; or (ii) 3 is a divisor of none of the 
three numbers a, f, and y, but 3 is a divisor of the double area D = bicz — bc. 

Only partial solutions were received for part (b). Searches by John H. Lindsey II and 
by the GCHQ Problems Group found the values i < 1000 satisfying the condition. The 
two lists are the same, except that 906 appears in one list and not the other. The remaining 
values found were: 3, 6, 15, 18, 30, 36, 48, 51, 63, 78, 90, 108, 120, 138, 150, 156, 168, 
210, 228, 270, 300, 303, 336, 360, 378, 408, 426, 438, 480, 510, 528, 531, 630, 660, 723, 
738, 750, 780, 888, 930, 990, 996. 


Part (a) also solved by J. H. Lindsey II, GCHQ Problems Group (U. K.), and the proposer. 


A Surrounded Set 


10608 [1997, 664]. Proposed by Victor Zalgaller, Steklov Mathematical Institute, St. Pe- 
tersberg, Russia. Let S be a compact convex set in the plane. If / is any line of support for 
S, let fl) be the length of the shortest curve that begins and ends on / and that together 
with / surrounds S. Prove that if f (J) is independent of /, then S is a circle. 


Solution by John Arkinstall, Monash University, Australia. Let l’ be the support line parallel 
to 1 on the opposite side of S. Since f(J) + f(’/) is independent of | but also equals the 
perimeter of S plus twice the width of S perpendicular to /, S is a set of constant width w. 
A line 1” parallel to 1 and I’ whose intersection with S is of maximum length is called a 
diameter of S in the direction of /. Because S has constant width w, we may conclude that 
the length of such a diameter is w, that it joins two points where support lines perpendicular 
to J touch S, that these two support lines, together with / and J’, form a square of side w, 
and that each support line touches S in a unique point. A theorem of Khassa (Relation 
between maximal chords and symmetry for convex sets, J. London Math. Soc. 15 (1977) 
541-546) states that a convex curve of constant width in which the diameter in each direction 
is midway between the two support lines in that direction must be a circle. Thus it suffices 
to prove this “midway” property. 

When S is a set in the plane of constant width w, 2A(/) — wP(J) is independent of 
1, where A(J) is the area of the portion of S between the opposite support line 1’ and the 
diameter /”, and P(/) is the length of that part of the perimeter of S on the same side of 1” 
as I’ (L. Beretta & A. Maxia, “Insiemi convessi e orbiformi,” Univ. Roma e Ist. Naz. Alta 
Mat. Rend. Mat. (5) 1 (1940) 1-64). Let r(/) denote the distance from / to the diameter 
in the direction of J. Since r(J) is the length of the supporting line segment from / to the 
boundary of S on the shortest curve from / surrounding S, we have f(/) = 2r(/) + P(J). 
Since this is independent of | by hypothesis, so is AV) + wr(J). This is the area of the 
convex hull of S and the two supporting line segments from / on the shortest curve from /. 
The complement R of this convex hull in the supporting square to S with side length w and 
edge along / therefore also has area independent of the direction of /. 

/ 
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Now consider how the area of R changes when the direction of / is changed by a small 
angle @. If the point of support on I’ divides its side of the supporting square in the ratio 
u: w — u, then the area of R changes by four small approximately triangular regions: It 
decreases by (1/2)r(l’)@ + o(@), increases by (1/2)u@ + o(f), decreases by (1/2)(w — 
u)d + o(d), and increases by (1/2)r(l’)¢ + o(@). Thus, the area of R changes by the sum 
(1/2)(Qu — w)d + o(@). Since this is 0, we have 2u — w = 0, and thus the support point 
on l’ is midway between the two support lines perpendicular to 1’. 


Solved also by S. S. Kim (Korea), J. G. Merickel, GCHQ Problems Group (U. K.), and the proposer. 


Tight Bounds for the Normal Distribution 


10611 [1997, 665]. Proposed by Zoltan Sasvari, Technical University of Dresden, Dresden, 
Germany. Find the largest value of a and the smallest value of b for which the inequalities 


1+V1—e-4 1+V1— e756 
2 2 


hold for all x > 0, where P(x) = 


P(x) 
L x  —y2/2 

Von f50 e / dy. 

Solution by Hongwei Chen, Christopher Newport University, Newport News, VA. We show 

that a = 1/2 and b = 2/7 are the best possible constants for which the stated inequalities 

hold. Since { os eV /2 dy = foe ‘2 dy = Ja /2, the stated inequalities are equivalent 


to 
V1 —e74? V1 — ea bx? 
Se (1) 
2 2 
where f(x) = io ie eW¥'/2 dy. If the second inequality of (1) holds for all x > 0, then 


O< 


V1 —e7bx? Jb 1 3 


as x — O, which implies b > 2/z. Similarly, if the first inequality of (1) holds for all 
x > 0, then 


J1—e7ax2 —x?/2 —x? /2 
O< f (x) = Nee = co +0 ee) = é 410 ee 


Z Qnx se 


as x — oo. Dividing each side by en" /2 yields a < 1/2. 
To show that inequalities (1) hold for all x > O when a = 1/2 and b = 2/z, we write 


1 XxX XxX 
(FO)? = = | | oP)? aydz. 
0 0 


Let D = [0, x]*, Di = {(y,z):0<y, 0<z, y> +27 < x7}, and D2 = {(y,2):0<y, 
O<z, y*+2z* < (4/m)x*}. We have the inequalities 


i I OG yp ea I e422 aye cL I e422 aydz, (2) 
2m JID, 2n JJp 2m JJ D>» 


the first because D; Cc D, and the second because D and D>» have the same area and 
eV 2°/2 < e- Q2/m)x" fog (y,z) € D — D2 while eV #27)/2 > g-(2/m)x" fog (y,z) € 
Dz — D. Evaluating the outer integrals in (2) in polar coordinates, we obtain 


which is equivalent to (1). 


Solved also by P. Alsholm (Denmark), J. Anglesio (France), P. Bracken (Canada), B. Burdick, G. G. Chappell, P. Devaraj (India), 
G. Keselman, K.-W. Lau (Hong Kong), J. H. Lindsey II, A. Stadler, GCHQ Problems Group (U. K.), NCCU Problems Group, 
NSA Problems Group, WMC Problems Group, and the proposer. 
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REVIEWS 


Edited by Harold P. Boas 
Mathematics Department, Texas A & M University, College Station, TX 77843 


Mathematics: From the Birth of Numbers. By Jan Gullberg, W. W. Norton, 1997, 
xxili + 1093 pp., $50. 


Reviewed by Arnold Allen 


Gullberg’s book has worthy competitors for the title The People’s Guide to Mathe- 
matics. In fact, at this time there is a veritable cornucopia of excellent mathematics 
books for the general reader. Ones that I particularly like are those by Courant 
and Robbins (revised by Stewart) [3], Devlin [4, 5], Dunham [6], Hildebrandt and 
Tromba [8], Jacobs [9], and Stewart [12, 13] for surveys of mathematics. For 
enlightenment in more specialized areas, I like Casti [1], Conway and Guy [2], 
K6rner [10], and Vilenkin [14]. Gullberg’s book is clearly the overall winner. This 
book will appeal to a range of MONTHLY readers as well, from undergraduate math 
majors to instructors. It is a wonderful read. I take it with me everywhere I go: to 
the dentist’s office, waiting in the doctor’s office, to boring meetings—wherever I 
have an opportunity to read. 
The book lived up to its description on the dust jacket, which included: 


This gently guided, profusely illustrated Grand Tour of the world of mathematics takes the 
reader on a long and fascinating journey—from the dual invention of numbers and language, 
through the primary realms of arithmetic, algebra, geometry, trigonometry, and calculus, to the 
final destination of differential equations, with excursions into symbolic logic, set theory, 
topology, fractals, probability, and assorted other mathematical byways. ...The text is inter- 
spersed with more than 1,000 original, high-quality technical illustrations, a multitude of 
reproductions from mathematical classics and other relevant works, and a generous sprinkling of 
humorous asides ranging from limericks and tall stories to cartoons and decorative drawings. 


The sentence following the ellipsis explains some of the special charm of the book. 
Gullberg has made even more innovative use of the margin than Graham, Knuth, 
and Patashnik [7] did with their irreverent student “graffiti.” Gullberg has excellent 
quotes and quips throughout the book, including the margin, but he also uses the 
margin for noting things mentioned in the preceding paragraph as well as for 
pointers to related topics, biographies of famous mathematicians, labels concern- 
ing the nearby text, photographs, and other uses too numerous to catalog. 

Let me describe my ramblings in this remarkable book. The first chapter 
discusses the birth of numbers or “Where did numbers come from, anyway, and 
what are their properties?” This chapter is followed by two excellent chapters that 
describe systems of enumeration and types of numbers. The next chapter, 
“Cornerstones of Mathematics,” is my favorite chapter and will appeal to begin- 
ners and old hands as well. I was amazed to learn in this chapter that the 
pencil-and-paper multiplication algorithm we learned in grade school is not the 
only such multiplication algorithm. Gullberg demonstrates “the Egyptian method 
of duplation” algorithm for multiplication. (He also displays a multiplication table 
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from the Bomberger Rechenbuch of 1483 which indicates that 7 x 3 = 12. Murphy’s 
law had already been passed!) I loved his example of the “Russian peasant” 
method of “multiplication by successive duplation and mediation.” I learned a 
dividing and averaging method, originating with the ancient Babylonians, for 
finding the square root of a number; it is efficient, too. I learned how to 
approximate ¥6 by a continued fraction and how to convert 221/41 to the 


continued fraction 


lS — 
De 


1+ 


34+- 


I saw Pascal’s triangle as Pascal drew it. Gullberg shows it as found in Pascal’s book 
Traité du Triangle Arithmétique, Paris, 1665. We see it in another form from a book 
written in 1303 by the Chinese mathematician Chu Shih-chieh. Pascal was unaware 
of this earlier work and drew his triangle very differently from the way we now 
draw it. I learned how to use an abacus for multiplication and division. The 
decorative drawings in Figure 1 appear in the margin of the book next to the last 
four instructions for dividing 377 by 26 with the abacus. 


O, 
es 
8 4d 
sh _) 


a0 Y 


PPYVELEA. 


[Did you understand it?] 
Kotae wa onaji deshita ka? 
[Did you get the same answer? ] 


Wakarimashita ka? 


[Finished!] 


a Y 
Owari! 


Figure 1 


I was delighted by Gullberg’s clear and brief chapter on combinatorics. In it he 
discusses several interesting combinatorial problems and their solutions. I espe- 
cially liked the section on graph theory and the section on magic squares and their 
kin. In the former, he discusses the famous seven bridges of KOnigsberg problem. 
He even provides a map of Konigsberg circa 1740! He describes Euler’s solution of 
the problem as well as the history of the related four-color map problem. Gullberg 
describes and displays several famous magic squares, such as Durer’s remarkable 
order 4 magic square, constructed in 1514, and Benjamin Franklin’s order 8 magic 
Square with some unbelievable properties. Yes, the Benjamin Franklin. Franklin’s 
magic square shown in Figure 2 appears in the margin of Gullberg’s book. 

Fibonacci’s sequence and related sequences are well treated in the chapter on 
sequences and series. Immediately after his statement of Fibonacci’s original 


78 REVIEWS [January 


FRANKLIN Benjamin 
(1706-1790) 
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Franklin’s Magic Square 


Figure 2 


problem, Gullberg provides an amusing drawing of a cluster of rabbits. Then, in a 
magical two pages, he provides a great deal of interesting information about the 
Fibonacci (F,,), Lucas (L,,), and Pell (P,,) sequences, including a discussion of the 
golden ratio and the Binet formulas for F, and L,. Another peak experience for 
me in this chapter was reading the section on figurate numbers. Using graphic 
software to produce displays representing numbers by dots, Gullberg develops the 
properties of triangular, oblong, and pentagonal numbers in the plane. Then, using 
three-dimensional displays of dots, he derives the properties of cubic, tetrahedral, 
and square pyramidal numbers. Finally, he demonstrates how to extend tetrahe- 
dral numbers to superhedral numbers. An enjoyable chapter! 

Fermat’s last theorem has been in the news, but I found it difficult to explain to 
friends. Now I refer them to Gullberg’s two-page discussion; it is at exactly the 
right level for those who are mathematically disadvantaged, but with some sophisti- 
cation as well. Friends do not have to endure a discussion of elliptic curves and the 
Taniyama-Shimura conjecture as a journalism-educated friend of mine did when 
she bravely attended a math meeting with me. 

The chapter “Theory of Equations” that contains the material on Fermat’s last 
theorem also demonstrates, with worked examples, how to solve almost any kind of 
equation: quadratic, cubic, quartic, Diophantine; systems of linear equations, 
systems of linear and quadratic equations; and others. A pleasure to read! 

I enjoyed Gullberg’s brief but fascinating “Overture to the Geometries.” It has 
the history and a quick sketch of the various branches. There are nice pictures, 
illustrations, and diagrams, including some elegant drawings of knots and other 
topological entities. I also liked the five chapters on geometry that follow, espe- 
cially the section in the trigonometry chapter on solving triangles. Gullberg shows 
how to keep difficult geometry problems from becoming boring; he solves them 
with élan. 

The most complete coverage of any subject in the book is given to mathematical 
analysis. There are chapters on differential and integral calculus and power series; 
detailed chapters on the major applications of calculus; a chapter on harmonic 
analysis with Fourier series; a chapter on methods of approximation; and an 
excellent introductory chapter on differential equations. Each chapter features 
theory as well as the solution of non-trivial problems. 

I loved the probability chapter. There is a wonderful section on the history of 
probability with the names and achievements of most of the major players. 
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Gullberg also provides an excellent survey, with examples, of basic probability and 
Statistics. He describes the basic distributions, such as normal and binomial, with 
their distribution functions and examples of how they are used. There is a 
photograph of a mustached gambler and his moll that looks suspiciously like 
Gullberg and his wife. 

I bought my copy of Gullberg’s book after leafing through it at a bookstore. It 
impressed me so much that I showed it to my colleagues at the office (I am part of 
a small R & D group). Everyone wanted to borrow it! I had to hide it in my desk to 
keep it from disappearing. One colleague got his wife a copy for her birthday—she 
is a high school math teacher. She loved it and showed it to their son’s math 
teacher who is now an enthusiastic owner, too. To improve mathematics education 
in the USA we might begin by providing every high school math teacher with a 
copy of the book. 

Later I learned that the large first printing was sold out almost immediately. 
Then I discovered that the Book-of-The-Month-Club had chosen it as one of its 
selections, and that the MAA has made it available to its members. The book has 
had the success that an author might dream of on writing a first book. Sometimes 
there is a bit of luck involved in becoming a best-selling author, perhaps an 
invitation to be a guest on Oprah Winfrey’s show. But the success of Gullberg’s 
book is not due to luck, or to Oprah: this really is a wonderful book! The dust 
jacket contains rave reviews from Martin Gardner, Philip Morrison, and Harold 
Jacobs. In addition, Peter Hilton wrote a forward ‘Mathematics in Our Culture.” 

What is the appeal of this book to the general reader? I could use a real estate 
analogy and say “Coverage, coverage, coverage.” The book has excellent chapters 
on the standard material that most MONTHLY readers—as well as many engineers, 
scientists, and general readers (the “two cultures folks”)—have studied some- 
where before but would like to review. In addition, there are special chapters or 
sections of chapters on subjects that many of us have never seen before—such as 
the first several chapters that cover the birth of numbers and their applications; 
two algorithms in Chapter 4 for finding cube roots (Gullberg says, ““We have found 
the good physician Trenchant’s method an excellent cure for insomnia”) as well as 
several methods for approximating cube roots; the section on the abacus, the slide 
rule, Napier’s bones, and the quipu; and the incredible Chapter 11, “Overture to 
the Geometries,” with much interesting material known only to specialists. 

J am surrounded by engineers who do not use mathematics on a daily basis but 
who occasionally have special problems that need some mathematical analysis. 
Since I am the only one around who has a Ph.D. in mathematics, they sometimes 
come to me to ask basic mathematical questions. Now I can just refer them to the 
Gullberg book for standard material. I believe it will become a general reference 
for many MONTHLY readers. There are other uses of the book for MONTHLY 
readers. The possibilities for undergraduate student readers include: 


¢ Reading the book to broaden their understanding of mathematics beyond 
their course work. 

e Using it as a supplement to many of their courses, such as calculus, college 
algebra, geometry, and statistics. 


The possibilities for instructors include the following: 


e Using it as a textbook for courses such as Mathematics for Poets or Calculus 
for Poets. An experiment like this is being tried by Dennis Watson and his 
colleagues at Clark College in Vancouver, Washington. 
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e Using it as a supplementary textbook. 
e Using it for extra credit research study and reports. 


As an author, I must note possible uses of the book for authors of undergraduate 
textbooks. These include: 


e Saving time and money for special figures or material by referring their 
readers to the book, rather than including the special material in their books. 
For example, I plan to refer readers of the book I’m writing to Gullberg’s book 
for the history of probability and to show them what Pascal’s triangle looked 
like to Pascal as well as to ancient Chinese mathematicians. This will be better 
than writing a history myself or getting the necessary permissions to make 
copies of the Pascal triangles. 

¢ Using it as a source of ideas for material to discuss and for examples to 
emulate. 


Gullberg’s book is a gigantic book in every sense of the word. It weighs four 
pounds and thirteen ounces and has 1100 pages. (While reading this book is most 
enjoyable, it gives a whole new meaning to the phrase “heavy reading.”) It took a 
giant effort to produce such a masterpiece. Gullberg spent more than ten years 
writing it, mostly at night after a full day as an active physician and surgeon. 
However, he got a lot of help, especially from his family. His son persuaded him to 
write it, provided most of the mathematical graphics, helped him keep his com- 
puter running, and counseled him about content. His wife, Ann, proofread the 
manuscript and drew many of the illustrations. Some others were done by their 
twin sons, Kamen and Kalin, when they were nine or ten years old. Gullberg also 
has talented friends in various disciplines who reviewed what he was doing, and he 
got some help from outside professionals. As he says in the preface, “My draft text 
was later polished with the help of distinguished professional mathematicians, 
linguists, and historians.” 

Although he got others to help him with the writing of the book, he did the 
actual writing using a Macintosh Plus and Microsoft Word 3.0. These were 
state-of-the-art when he began his writing, but are ancient relics now. It was a most 
challenging task to keep the Macintosh Plus going all those years and to hack the 
equations without LaTeX or a commercial software package. However, he did it. 
As he notes on the copyright page: “Camera-ready copy for this book was 
produced entirely by the author, utilizing a combination of modern desktop-pub- 
lishing and traditional paste-up methods.” This was an incredible achievement. 

Peter Renz [11] describes some of the problems Gullberg had with his computer 
and how and why the book was written. Renz also gives a biographical sketch of 
Gullberg and some details of the production and publication of the book. 

These are my three wishes for an encore from Dr. Jan Gullberg: 


¢ Set up a web site so that we can have two-way communication with him. For 
example, instructors who are using his book could let him know why they 
decided to use it, and Dr. Gullberg could make this information available to 
others. Everyone could suggest new exercises, or potential improvements, or 
corrections, and he could post additional material, corrections, etc. 

e¢ Write a solutions manual for the exercises or commission someone to do so. 

¢ Write a second volume containing material he hadn’t time or room enough to 
put in the present book, such as Green’s theorem, wavelets, encryption 
techniques such as trap door ciphers—the kind of material that is described 
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in Barry Cipra’s series for the American Mathematical Society on what’s 
happening in the mathematical sciences. 


Jan Gullberg’s book is a giant leap forward for mathematics and all those who 
love it! 
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EDITOR’S CORNER. I am a mathematician today because of an experience in 
kindergarten. I have always been interested in analyzing patterns, and from an 
early age I was excited by all sorts of intellectual games having a strong element of 
pattern recognition: chess, go, Scrabble, crossword puzzles, and so forth. The first 
really sophisticated pattern-recognition task that young children face is learning to 
read, and I was starting to read at home before I began formal schooling. 
However, because reading was on the syllabus for first grade, my kindergarten 
teacher viewed reading as a proscribed activity and successfully quashed my initial 
enthusiasm for the printed word. Consequently, I redirected my energies toward 
arithmetic, which the teacher apparently did not recognize as a subversive subject. 
The result was that later on, I was always a year ahead of my schoolmates in 
mathematics, and so I ended up a mathematics major in college. Thus, it is thanks 
to my kindergarten teacher that I am today a specialist on the theory of the 
Bergman kernel function rather than on, say, the works of Jane Austen. 

There is a moral in this story for the public school system, but I should like to 
address instead what this story suggests about how to teach mathematics effec- 
tively. It is a familiar metaphor that mathematics is a language, and in my early 
experience mathematics and language were both difficult—and therefore interest- 
ing—for the same reason. Both subjects to me were games presenting an artificial 
universe operating by well defined, albeit arbitrary, rules. The goal was to under- 
stand the rules and their consequences so thoroughly that they became internal- 
ized and automatic, in which case one could justifiably claim to have solved the 
game. It is in this sense that a sixth-grader who can unscramble Rubik’s cube 
rapidly while blindfolded can assert mastery of that game. 

Now if mathematics is a language, why are the standard methods of teaching 
mathematics not even remotely similar to the standard methods of teaching 
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languages? The way to become fluent in a language is total immersion. Has anyone 
tried to foster mathematical ability by the same method? Actually, yes: intensive 
summer programs for teenagers, such as the famous one developed at Ohio State 
by Arnold Ross, have indeed proved effective. However, the standard mathematics 
program in a typical American school is more akin to wading than to immersion. If 
our children spend one-fourth the time on mathematics that they spend watching 
television, is it any wonder that they fared poorly in the recent Third International 
Mathematics and Science Study? (See http: // wwwcsteep.bc.edu/ timss for 
detailed information about TIMSS.) 

Children seem automatically to soak up their native language. A foreign 
language is a better comparison to mathematics. In French class, for example, 
students get practice in speaking French, reading French, and writing French; drill 
on French vocabulary and grammar; and exposure to French culture and history. 
Compare this with mathematics instruction. It is an uncommon mathematics class 
that pays much attention to mathematical culture and history. Although the 
so-called “reform” movement supports giving students practice on reading and 
writing mathematics, this idea is spreading slowly. If you have not tried the 
experiment, assigning your mathematics class a writing task will be an eye-opening 
experience. At the beginning of the semester, I assigned a class of engineering 
students to write a short essay interpreting part of the introductory chapter in their 
differential equations textbook. Ten percent of the students immediately dropped 
the course. I was chagrined to discover that of those students who completed the 
assignment, about one-third had not understood the book’s discussion of implicit 
solutions versus explicit solutions because they did not know the meaning of the 
word “implicit.” It is a struggle to teach students in a way that is different from the 
way they were taught in grade school and high school. 

Please note that I am not complaining that students are getting worse and that 
the world is going to the dogs. (This is true, and has been true throughout 
recorded history, but that is another story.) Actually, teaching college mathematics 
is a rewarding experience if one takes advantage of the skills that modern students 
have. My students cannot do arithmetic rapidly and may not comprehend words 
derived from Latin roots, but they have well developed social skills, know how to 
work collaboratively in groups, can give coherent oral presentations, and are highly 
creative if given a little encouragement. I fondly remember one response to an 
examination question I posed requiring the students to invent a scenario involving 
a swimming pool for which a given differential equation would be an appropriate 
model. The solution I expected was a mixing problem concerning the concentra- 
tion of chlorine, but one student’s (mostly correct) answer began: “Kim drops a 
calculator into a swimming pool from a height of 100 meters... .” It is not the case 
that today’s students cannot think, but it is true that their intellectual and cultural 
backgrounds are different from those of their teachers. My point is that, like 
generals who prepare to fight the previous war, textbook expositors tend to write 
prose for the previous generation. 

In a standard mathematics book, what gets taught is primarily mathematical 
vocabulary and rules of mathematical grammar. Perhaps in advanced undergradu- 
ate classes, students may begin to learn some principles of mathematical composi- 
tion, but it is only in graduate school that one gets exposed to the mathematical 
analogues of fine literature and poetry. | | 

It would be wonderful to have some textbooks that go beyond mathematical 
vocabulary and grammar to incorporate the mathematical equivalents of literature, 
poetry, culture, and history in a way that students can understand and appreciate. 
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Reading Arnold Allen’s review of Jan Gullberg’s Mathematics: From the Birth of 
Numbers made me think that it might be such a book, so I obtained a copy and 
read it. My impressions of the book are those of a university professor of 
mathematics active in both research and teaching. 

The first impression one gets from Gullberg’s massive tome is that producing it 
was a tremendous effort. Gullberg is a remarkable person: not a professional 
mathematician, he had the motivation, enthusiasm, and dedication to spend a 
decade learning about mathematics and recording his findings. His book is an epic 
written by an amateur. 

It is not surprising that the best parts of the book are the ones that do not 
involve advanced mathematics. An amateur has the opportunity for originality of 
presentation when discussing the history of mathematics and those topics generally 
known as “recreational mathematics.” It is here that Gullberg shines. Even 
professionals will find some amusing tidbits in the book. For example, I did not 
know that the circumcenter, centroid, and orthocenter of a triangle are on the 
same line, and reading this statement in Chapter 12 sent me to the library to look 
for a geometry book with proofs in it. 

The second half of the book, where Gullberg tackles calculus and other parts of 
the university mathematics curriculum, disappointed me. Presumably Gullberg 
learned his calculus from a standard textbook, and his sections on calculus read 
like a distillate of any traditional calculus book. In Gullberg’s encyclopedic work, 
there is little space for discursive exposition, and the impression that comes across 
is that mathematics consists of rules, formulas, and algorithms. If an educated 
layperson with extraordinary motivation has this concept of mathematics after 
studying our textbooks, then we writers of mathematics books have failed miser- 
ably. 

In any book of over 2'° pages, there are mistakes. I found fewer than expected. 
There are some typographical errors, such as on page 570 where the eccentricity of 
an ellipse is given as e < 0 instead of « < 1. Then there are some mathematical 
misconceptions, such as the confused statement and proof of Cauchy’s mean-value 
theorem on pages 718-719, where Gullberg demands that the two functions have 
matching values at the endpoints (which makes the theorem rather trivial). I found 
only one true howler: on page 735, Gullberg comes unstuck calculating the 
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indefinite integral of the function f defined by f(x) = 1/(/x + Vx) and obtains a 
wrong answer that is even singular at x = 1. 

Gullberg is writing at the limit of his knowledge, and it shows. Every college 
mathematics teacher who reads this book will wince at some statements that are 
not exactly wrong, but that will certainly mislead naive readers. Here are some 
examples—admittedly taken out of context—of statements that make me queasy. 


“with the assistance of computers, we can easily obtain approximate values of 

virtually any convergent series” (page 282) 

“points on the graph of a function may approach successively nearer to a 

straight line, the asymptote” (page 341) 

¢ “topology is a non-metric geometry” (page 369) 

e “Unlike matrices, which may be of any rectangular shape, determinants are 
always square.” (page 646) 

e “ji! =e 7/*” (page 793) 

“A differentiable function of one independent variable has a maximum or a 

minimum where its first derivative is zero.” (page 816) 
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Exercise. Why do I object to each of these statements? 

Many readers will find Gullberg’s book valuable, despite its imperfections. 
However, it is not the book I hoped for. Although it has history and culture, and 
lots of vocabulary and grammar, it does not get to literature and poetry; nor does it 
give any inkling that most of the mathematics that exists was created in the 
twentieth century. I would not dare to give this book to my niece, for it would only 
confirm her belief that mathematics is just boring formulas, but I would have been 
a grateful recipient if someone had given me this book when I was a high school 
freshman. It would have opened my eyes to what an extensive game mathematics 
is, and I would have had fun playing with the formulas and trying to figure out why ~ 
they work. 

Gullberg produced camera-ready copy for this book himself, which was a 
tremendous job. I wonder why he did not enlist the aid of a professional typesetter, 
who could have warned him against solecisms such as increasing the inter-letter 
spacing of a widow word on the last line of a paragraph. The typesetting is a 
metaphor for the mathematics in the book: it is serviceable, but it does not look 
quite right to an expert eye. 

I continue to wait for someone to write a book that treats mathematics as a 
foreign language. I want to give a copy to my kindergarten teacher. 


—RHarold P. Boas 


Handbook of Applied Cryptography. By Alfred J. Menezes, Paul C. van Oorschot, and 


Scott A. Vanstone. CRC Press, Boca Raton, 1997, 780 + xxviii, $79.95. 
The Cryptographic Imagination: Secret Writing from Edgar Poe to the Internet. By Shawn 
James Rosenheim. Johns Hopkins University Press, Baltimore, 1997, 264, $47.50. 


Reviewed by Jeffrey Shallit 


Cryptography is the art and science of writing and reading concealed messages. 
David Kahn, in his monumental work, The Codebreakers |9], traces its origin to 
1900 B.C.E., in an inscription carved on the tomb of Khnumhotep II. Not 
surprisingly, modern cryptography has a strong mathematical component. Indeed, 
just weeks before the Japanese attack on Pearl Harbor in World War II—a war 
where cryptography would play a crucial role—algebraist A. A. Albert stated in a 
regional meeting of the American Mathematical Society, “It would not be an 
exaggeration to state that abstract cryptography is identical with abstract mathe- 
matics.” [1] 

As we make the transition to a wired society, cryptography is becoming more 
and more fundamental. Cryptography can be used to preserve the privacy of 
messages exchanged over the Internet, to provide a means for electronically 
signing messages (so that recipients can be assured the message really came from 
the stated author), to prevent unauthorized access to sensitive information, and to 
provide a secure system for “electronic cash.” Of course, like any useful tool, it can 
also be used to conceal evidence of lawbreaking, which is one reason why the 
debate over cryptography policy is so heated [8]. The United States government 
continues to restrict export of certain kinds of cryptographic software, although 
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lately there have been some signs of weakening of this policy. As I write this, 
Senators Conrad Burns (R MT) and Patrick Leahy (D VT) have introduced the 
“Promotion of Commerce Online in the Digital Era (Pro-CODE) Act,” a bill 
designed to relax government controls on encryption. 

Although some (notably Dorothy Denning [5]) continue to endorse some kinds 
of government control over cryptography, most observers concede that the genie is 
out of the bottle, and no law can stuff it back in. Some of the mathematics involved 
in cryptography is so elementary that a bright high-school student can devise 
encryption software essentially uncrackable with currently available techniques. 

Modern cryptography has traveled a long way from the simple substitution 
cipher, where messages are encoded by applying a permutation of the letters 
{A, B,C,..., Z}. As a glance at the proceedings of the latest conferences on the 
subject (CRYPTO, EUROCRYPT, AUSCRYPT, ASIACRYPT) demonstrates, 
modern cryptography is based largely on number theory, combinatorics, and 
algebra. Although some cryptographic techniques are quite elementary, others 
involve the deepest and most modern aspects of these areas. 

The so-called public-key cryptosystem is a fundamental discovery of modern 
cryptography. In older methods based on secret keys, the same key is used to 
encode and decode a message. Losing the key means that the secrecy of the 
message is compromised. In 1976, however, Whit Diffie and Martin Hellman 
invented a new kind of cryptosystem in which the encryption and decryption keys 
differ [6]. To employ the system, a user U publishes the encryption key e for all to 
see. By using the key, anyone can send an encoded message to U. But U keeps the 
decryption key d a secret, and the system is designed so that obtaining d from e is 
a problem that appears to be computationally intractable. In practice, then, 
without enormous computing resources or extraordinary luck, only U can read the 
message because only U knows the decryption key d. 

The RSA scheme—the name comes from the initials of its inventors, Rivest, 
Shamir, and Adleman—is the most famous example of a public-key cryptosystem 
[11]. It is based on the apparent intractability of computing kth roots modulo 
composite numbers. Invented in 1977, the system continues to withstand the 
various attacks devised against it, although research has provided some caveats 
about how best to choose the particular parameters involved. More recent propos- 
als are based on number theory, and draw their inspiration from the discrete 
logarithm problem, the quadratic residuacity problem, and the theory of elliptic 
curves. 

Another advance of modern cryptography is the digital signature. Here the goal 
is to attach a number to a message depending on (a) some secret key known only 
to the signer and (b) the content of the message itself. A good digital signature 
scheme allows the user to verify a message’s authenticity and prevents later 
repudiation of signatures. Secure digital signatures are essential if electronic 
commerce is to become a reality. 

In their Handbook of Applied Cryptography, Menezes, van Oorschot, and Van- 
stone convincingly demonstrate the serious mathematical content of applied cryp- 
tography. The authors have significant expertise in both theoretical and practical 
aspects of cryptography. Vanstone, for example, is a founder of a corporation that 
makes encryption products, and is the author of at least two dozen papers in the 
literature on cryptographic topics. Menezes wrote his Ph.D. dissertation on public- 
key cryptography [10], and van Oorschot is employed as a cryptographer. 

The approach taken in this book is encyclopedic. Nearly every aspect of modern 
cryptography—stream ciphers, block ciphers, public-key cryptography, hash func- 
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tions, digital signatures, key establishment protocols, etc.—is covered in its 
800 pages. Each chapter contains a “Notes” section, with valuable historical 
remarks. The book contains over 1200 citations to the literature. 

The Handbook strongly emphasizes practical aspects of cryptography. For 
example, Chapter 14 discusses how to perform extended-precision arithmetic 
efficiently (essential for modern cryptography, which routinely deals with numbers 
of 100 to 400 digits). 

The Handbook probably isn’t appropriate as a textbook or gentle introduction 
to cryptography—for that, see instead [3] or [12]. It is probably more appropriate 
as a reference work, and in that it succeeds admirably. There are, however, some 
minor flaws. For example, the surname of Franz Mertens is misspelled twice. More 
seriously, Definition 2.73 erroneously states that “A problem is NP-hard if the 
existence of a polynomial-time algorithm for its solution implies that P = NP.” 
While this is correct if, as most suspect, P # NP, it is incorrect if P = NP. The 
correct definition is that a problem X is NP-hard if every problem in NP reduces 
to X in polynomial time. Similarly, Definition 3.1, which defines polynomial-time 
reduction, is regrettably imprecise, drawing no distinction between Turing reduc- 
tion and many-one reduction. Other errors can be found in the world-wide-web 
page for the book, http: // www.dms.auburn.edu/ hac /. 

Another annoyance is that the index is inadequate. For example, there is no 
entry for either “NP” or “nondeterministic polynomial time.” Also missing is an 
index to names, so that it is very difficult to determine where a particular paper is 
cited in the text. Nevertheless, despite these minor problems, the Handbook 
should prove to be a very useful reference for mathematicians and cryptographers. 

The contrast between the Handbook and Shawn Rosenheim’s The Cryptographic 
Imagination couldn’t be more marked. Rosenheim, a professor of English and 
American Studies at Williams College, has produced a rambling, disjointed medi- 
tation on cryptography, Edgar Allan Poe, espionage, Thomas Pynchon, and the 
Internet. 

Unfortunately, Rosenheim’s attempts to discuss technical matters are nearly 
always marked by severe misunderstandings of the mathematics and physics 
involved. For example, consider his definition of quantum cryptography, which 
appears in the glossary: 


A form of cryptography in which, under certain experimental conditions, pairs of photons may be 
created that exert an influence over one another that cannot be explained by quantum 
mechanics. Measuring the polarization of one particle immediately and identically changes the 
spin on its antiparticle. Such polarization takes place regardless of the relative positions of the 
two particles in the universe, in a result that seems to violate the second law of classical theory. 
It is theoretically possible that a stream of such polarized photons could be used to encipher 
messages that could be sent over space in literally no time at all. 


There are so many errors in just these four sentences that it is difficult to know 
where to begin. First of all, the behavior of entangled photon pairs is, contrary to 
the claim, perfectly explicable through quantum mechanics. Second, practical 
quantum cryptography is not currently based on entangled photon pairs—although 
Ekert [7] did propose such a scheme—but a different mechanism proposed much 
earlier by Wiesner [13] and Bennett and Brassard [3]. Third, the reference to the 
“second law” is, of course, utter nonsense. Finally, quantum cryptography is not 
simply a theoretical possibility, but a practical reality [2]. 

Other blunders in The Cryptographic Imagination include conflating monkeys 
and apes, misstating Zipf’s law, wildly overestimating the amount of pornography 
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on the Internet, misstating the name of the Usenet newsgroup alt.sexual.abuse.re - 
covery, and comically misspelling the name of one of the inventors of RSA as 
“Ronald Rivers.” Rosenheim even makes mistakes in his own field: he claims that 
Georges Perec’s book La Vie: Mode d’Emploi was written without the letter “e,” 
when in fact it is another book by Perec entitled La Disparition. 

This is not to say that I didn’t get anything out of Rosenheim’s book. I was 
intrigued to learn about Lizzie Doten, a 19th century mystic who “channeled” 
ersatz poems of Poe and other writers such as Shakespeare and Burns. But the 
book is marred by the usual postmodernist excesses: making much of tenuous 
or nonexistent connections, second-rate wordplay (the series in which The 
Cryptographic Imagination is published is entitled “re-visions of culture and society”; 
among postmodernists, this sort of gratuitous hyphen insertion is apparently 
considered essential), and opaque exposition. Consider the following two 
examples: 


When I claim that Poe helped end World War II, the “Poe” in that sentence represents both a 
particular author and the literary genre he helped create and for which he serves as a 
synecdoche. (p. 15) 


Such a homeopathic technique for the creation of mysteries produces highly cathected readers; 
the surface of the cipher produces a crypt in us, which we proceed to fill with our imagination, 
just as the semantic vacuity of Khumnhotep’s [sic] glyphs contextually signified Khumnhotep’s 
[sic] power and his resistance to comprehension. (p. 48) 


The Cryptographic Imagination will be of little interest to anyone wanting to learn 
about cryptography. In fact, I can scarcely think of a reason to read it, except 
perhaps to see an example of what passes for scholarly work in some academic 
disciplines. 
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Selected books 


Books submitted for review should be sent to Book Reviews Editor, American Mathe- 
matical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, MN 55057-1098. 


General, P, L**. Calendrical Calculations. 
Nachum Dershowitz, Edward M. Reingold. 
Cambridge Univ Pr, 1997, xxi + 307 pp, 
$22.95 (P); $64.95. [ISBN 0-521-56474-3; 0- 
5621-56413-1] Algorithms for calculations in 
14 calendars of current and historical interest 
(including the Gregorian, Julian, Islamic, He- 
brew, Mayan, Chinese, and Hindu calendars). 
Implementations of the algorithms (in Lisp) are 
given in an appendix. Many interesting notes 
and comments on history of chronology. AO 


General, P. Algebra and Operator Theory. 
Eds: Yusupdjan Khakimdjanov, Michel Goze, 
Shavkat A. Ayupov. Kluwer Academic 1998, 
viii + 250 pp, $114. [ISBN 0-7923-5094-4] 
Proceedings of a 1997 French—Uzbek collo- 
quium in Tashkent, Uzbekistan. 


General, P, L. World Directory of Mathemati- 
cians 1998, 11th Edition. International Math- 
ematical Union. AMS, 1998, xxii + 1093 pp, 
$65 (P). 

Reference, P, L. Handbook of Mathematics 
and Computational Science. John W. Harris, 
Horst Stocker. Springer-Verlag, 1998, xxviii 
+ 1028 pp, $29.95. [ISBN 0-387-94746-9] 
Comprehensive collection of commonly used 
definitions, facts, formulae, tables, techniques, 
etc. Includes numerical as well as analytic tech- 
niques, and a brief introduction to several com- 
puter programming languages. AO 


Education, P*. Research in Collegiate Math- 
ematics Education, Il. Eds: Alan H. Schoen- 
feld, Jim Kaput, Ed Dubinsky. CBMS Issues in 
Math. Educ., V. 7. AMS, 1998, xii + 313 pp, 
$40 (P). [ISBN 0-8218-0882-6] Papers de- 


1999] 


scribe methodology and research on problem 
solving, understanding of concepts and proofs. 


History, P. Paul Dirac: The Man and His Work. 
Abraham Pais, et al. Cambridge Univ Pr, 1998, 
xv + 124 pp, $19.95. [ISBN 0-521-58382-9] 
Lectures about Dirac’s life and work by A. Pais, 
M. Jacob, D.I. Olive, and M.F. Atiyah as well as 
a memorial address given by Stephen Hawking. 


Logic, P. Advances in Modal Logic, Volume 
I. Eds: Marcus Kracht, et al. CSLI Lect. 
Notes, No. 87. CSLI Pub (Center for Study 
of Language & Information, Leland Stanford 
Junior Univ., Stanford, CA 94305), 1998, xvi 
+ 392 pp, $24.95 (P). [ISBN 1-57586-102-X] 
Papers from a 1996 conference at the Free Uni- 
versity of Berlin. Thematic areas: mathematics 
of modal logic; artificial intelligence and know!- 
edge representation; computation; philosophy 
and language; proof theory. 

Logic, P. The Tbilisi Symposium on Logic, 
Language and Computation: Selected Papers. 
Eds: Jonathan Ginzburg, et al. Stud. in Logic, 
Lang. & Inform. CSLI Pub (Center for Study 
of Language & Information, Leland Stanford 
Junior Univ., Stanford, CA 94305), 1998, xxx1 
+ 376 pp, $24.95 (P). [ISBN 1-57586-098- 
8] 23 papers from a 1995 conference in Gu- 
dauri (Republic of Georgia). Topics: natural 
language semantics; dynamic semantics and 
channel theory; theoretical linguistics; compu- 
tational linguistics; formal logic theory; mathe- 
matical logic; theoretical computer science. 


Number Theory, P. The Theory of Partitions. 
George E. Andrews. Math. Lib. Cambridge 
Univ Pr, 1998, xvi + 255 pp, $29.95 (P). [ISBN 
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0-521-63766-X] Republication of the 1976 
Addison-Wesley edition (TR, June-July 1977). 


Group Theory, P. The Atlas of Finite Groups: 
Ten Years On. Eds: Robert Curtis, Robert 
Wilson. London Math. Soc. Lect. Note Ser., 
V. 249. Cambridge Univ Pr, 1998, xvii +293 pp, 
$44.95 (P). [ISBN 0-521-57587-7] 20 arti- 
cles from a 1995 conference in Birmingham, 
England. 


Group Theory, P. Representations and Coho- 
mology, II: Cohomology of Groups and Mod- 
ules. DJ. Benson. Stud. in Adv. Math., 
V. 31. Cambridge Univ Pr, 1998, xi + 279 pp, 
$29.95 (P); $69.95. [ISBN 0-521-63652-3; 0- 
521-36135-4] Paperback edition. 


Ring Theory, P. Representations of Affine Lie 
Algebras. Vyacheslav M. Futorny. Papers in 
Pure & Appl. Math., V. 106. Queen’s Univ, 
1997, xi+ 89 pp, (P). [ISBN 0-88911-756-X] 


Topological Groups, P. Positivity in Lie The- 
ory: Open Problems. Eds: Joachim Hilgert, 
et al. Expos. in Math., V. 26. Walter de 
Gruyter, 1998, xii + 290 pp, $138.95. [ISBN 3- 
11-016112-5] 15 articles provide background 
and pose open problems in a variety of settings. 


Algebra, P. Mixed Motives. Marc Levine. 
Math. Surv. & Mono., V. 57. AMS, 1998, x + 
515 pp, $109. [ISBN 0-8218-0785-4] 


Calculus, T(15—16: 2), P. A Course in Ad- 
vanced Calculus. Robert S. Borden. Dover, 
1998, xii + 402 pp, $12.95 (P). [ISBN 0-486- 
67290-5] Unabridged, slightly corrected re- 
publication of the 1983 North Holland/Elsevier 
Science Publishing Co. edition (TR, March 
1984). Includes a new chapter on “Tips and 
Solutions for Selected Problems.” 


Real Analysis, T(17: 4), L. Modern Anal- 
ysis. Kenneth Kuttler. Stud. in Adv. Math. 
CRC Pr, 1998, 572 pp. [ISBN 0-8493-7166- 
X] Text for a first-year graduate course on 
Lebesgue integration and functional analysis. 
Covers standard topics (i.e., general set the- 
ory and topology, calculus in Banach spaces, 
Hilbert spaces, L? spaces, representation theo- 
rems, Fourier transforms) as well as some less 
standard topics: probability, Bochner integral, 
convex functions. KS 


Differential Equations, P. Advances in Dif- 
ferential Equations and Mathematical Physics. 
Eds: Eric Carlen, Evans M. Harrell, Michael 
Loss. Contemp. Math., V. 217. AMS, 1998, ix 
+ 221 pp, $35 (P). [ISBN 0-8218-0861-3] 14 
papers from the 1997 Georgia Tech-UAB Inter- 
national Conference on Differential Equations 
and Mathematical Physics. 
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Differential Equations, P. Generalized Quasi- 
linearization for Nonlinear Problems. V. Lak- 
shmikantham, A.S. Vatsala. Math. & Its Ap- 
plic., V. 440. Kluwer Academic, 1998, ix + 
276 pp, $140. [ISBN 0-7923-5038-3] 


Partial Differential Equations, T(17—18). 
Theory and Applications of Partial Differential 
Equations. Piero Bassanini, Alan R. Elcrat. 
Math. Concepts & Methods in Sci. & Eng., 
V. 46. Plenum Pr, 1997, ix + 439 pp, $115. 
[ISBN 0-306-45640-0] 


Partial Differential Equations, T(17—18). 
Partial Differential Equations and Boundary 
Value Problems. Viorel Barbu. Math. & Its 
Applic., V. 441. Kluwer Academic, 1998, xii + 
277 pp, $129. [ISBN 0-7923-5056-1] 
Numerical Analysis, S(18), P*, L*. Com- 
putational Integration. Arnold R. Krommer, 
Christoph W. Ueberhuber. SIAM, 1998, xix 
+ 445 pp, $64 (P). [ISBN 0-89871-374- 
9] Modern survey of methods and underlying 
mathematical principles. Gives special atten- 
tion to recent developments (e.g., parallel algo- 
rithms) and to the practical application of theo- 
retical results. Discusses both one- and multi- 
dimensional formulas. AO 


Numerical Analysis, P. Finite Element Anal- 
ysis of Acoustic Scattering. Frank Ihlenburg. 
Appl. Math. Sci., V. 132. Springer-Verlag, 
1998, xiv + 224 pp, $59.95. [ISBN 0-387- 
983 19-8] 


Numerical Analysis, P. Combined Methods 
for Elliptic Equations with Singularities, Inter- 
faces and Infinities. Zi Cai Li. Math. & Its 
Applic., V. 444. Kluwer Academic, 1998, xxiv 
+ 476 pp, $214. [ISBN 0-7923-5084-7] 


Functional Analysis, P. Morita Equivalence 
and Continuous-Trace C*-Algebras. Jain Rae- 
burn, Dana P. Williams. Math. Surv. & Mono., 
V. 60. AMS, 1998, xiv + 327 pp, $65. [ISBN 
0-8218-0860-5] 

Analysis, P. Weyl Transforms. M.W. Wong. 
Universitext. Springer-Verlag, 1998, viii + 
158 pp, $44.95. [ISBN 0-387-984 14-3] 
Analysis, P. Holomorphic Spaces. Eds: Shel- 
don Axler, John E. McCarthy, Donald Sarason. 
Math. Sci. Res. Inst., V. 33. Cambridge Univ 
Pr, 1998, ix + 476 pp, $54.95. [ISBN 0-521- 
63193-9] 16 expository papers presented in 
1995 at MSRI. Topics: Bergman spaces; Han- 
kel operators; the Dirichlet space; subnormal 
operators; operator models, interpolation prob- 
lems, systems theory. 


Analysis, P. Boundary Value Problems for 
Transport Equations. Valeri Agoshkov. Model- 
ing & Simul. in Sci., Eng., & Tech. Birkhiuser 
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Boston, 1998, xvii + 278 pp, $69.95. [ISBN 
0-8 176-3986-1] 

Analysis, P. Stability of Functional Equa- 
tions in Several Variables. Donald H. Hyers, 
George Isac, Themistocles M. Rassias. Prog. in 
Nonlinear Diff. Equat. & Their Applic., V. 34. 
Birkhauser Boston, 1998, vii + 313 pp, $79.50. 
[ISBN 0-8176-4024-X] 


Algebraic Geometry, P. Algebraic Groups 
and Their Birational Invariants. V.E. Voskre- 
senskii. Transl. of Math. Mono., V. 179. AMS, 
1998, xii+ 218 pp, $99. [ISBN 0-8218-0905-9] 


Algebraic Geometry, T(16: 2). Using Alge- 
braic Geometry. David Cox, John Little, Donal 
O’ Shea. Grad. Texts in Math., V. 185. Springer- 
Verlag, 1998, xii + 499 pp, $59.95. [ISBN 
0-387-98487-9] Overview of recent develop- 
ments in modern, computational algebraic ge- 
ometry (using Grdbner bases, resultants, etc.). 
Illustrates applications in combinatorics, local 
rings, algebraic coding theory. RM 


Differential Geometry, T(15-17: 3), P, L. 
Modern Differential Geometry of Curves and 
Surfaces with Mathematica, Second Edition. 
Alfred Gray. CRC Pr, 1998, xxiv + 1053 pp. 
[ISBN 0-8493-7164-3] New chapters on 
global curve theory, space curves, minimal sur- 
faces, inversions, cyclides, the Gauss—Bonnet 
Theorem, and global surface theory. (First Edi- 
tion, Extended Review, December 1995.) 


Differential Geometry, P. Compactifications 
of Symmetric Spaces. Yves Guivarc’h, Lizhen 
Ji, J.C. Taylor. Progress in Math., V. 156. 
Birkhauser Boston, 1998, xi + 284 pp, $54.95. 
[ISBN 0-8176-3899-7] 


Geometry, T(13—-14: 2), L. The Geometric 
Viewpoint: A Survey of Geometries. Thomas 
Q. Sibley. Addison-Wesley, 1998, ix + 309 pp, 
[ISBN 0-201-87450-4]; Instructor’s Resource 
Guide: Including Solutions and Projects for 
the Geometer’s Sketchpad to Accompany, 11 
+ 68 pp. [ISBN 0-201-32550-0] Readable 
introduction to and survey of modern geome- 
try including transformations, symmetry, pro- 
jective, finite, and non-Euclidean geometries. 
Examples and applications nicely motivate the 
development; interesting exercises. RM 


Geometry, P, L. Non-Euclidean Geometry, 
Sixth Edition. H.S.M. Coxeter. MAA, 1998, 
Xviii + 336 pp, $30.95 (P). [ISBN 0-88385- 
522-4] This republication of a classic text in- 
cludes a new section on the concept of inversive 
distance. . 


Algebraic Topology, P. Homotopy Theory 
via Algebraic Geometry and Group Represen- 
tations. Eds: Mark Mahowald, Stewart Priddy. 
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Contemp. Math., V. 220. AMS, 1998, xi + 
379 pp, $74 (P). [ISBN 0-8218-0805-2] Pro- 
ceedings of a 1997 conference at Northwestern 
University. 


Optimization, P. Advances in Nonlinear Pro- 
gramming. Ed: Ya-xiang Yuan. Appl. Optim., 
V. 14. Kluwer Academic, 1998, xiii + 351 pp, 
$149. [ISBN 0-7923-5053-7] Proceedings of 
a 1996 conference in Beijing, China, honoring 
M.J.D. Powell’s 60th birthday. Includes Pow- 
ell’s keynote address, 8 invited papers, and 9 
contributed papers. 


Optimization, P. Optimisation et anal- 
yse convexe. Jean-Baptiste Hiriart-Urruty. 
Presses Universitaires de France, 1998, 376 pp, 
198 FF (P). [ISBN 2-13-048983-4] 

Optimization, P. Quasiconvex Optimization 
and Location Theory. Jaoquim Anténio dos 


Santos Gromicho. Appl. Optim., V. 9. Kluwer 
Academic, 1998, xxi + 218 pp, $109. [ISBN 


0-7923-4694-7] 


Optimal Control, P. Lecture Notes in Control 
and Information Sciences—235: Ho. Control 
and Its Applications. Ben M. Chen. Springer- 
Verlag, 1998, xi + 351 pp, $92 (P). [ISBN 
1-85233-026-0] 

Optimal Control, P. Calculus of Variations 
and Optimal Control. A.A. Milyutin, N.P. Os- 
molovskii. Transl. of Math. Mono., V. 180. 
AMS, 1998, xii + 372 pp, $129. [ISBN 0- 
8218-0753-6] | 


Stochastic Processes, P. Random Fields 
and Stochastic Partial Differential Equations. 
Yu. A. Rozanov. Math. & Its Applic., V. 438. 
Kluwer Academic, 1998, vii + 229 pp, $105. 
[ISBN 0-7923-4984-9] 


Elementary Statistics, T(13: 2), C, L. Mod- 
ern Engineering Statistics. Lawrence L. Lapin. 
Duxbury Pr, 1997, xv + 583 pp, $78.95, with 
disk. [ISBN 0-534-50883-9] Emphasizes ap- 
plications and key statistical concepts. Favors 
insight over rigor; practicality over mathemati- 
cal elegance; simplicity over formalism. Many 
engineering applications. Topics: descriptive 
statistics, data analysis, statistical process con- 
trol, statistical models, probability, sampling 
distributions, statistical estimation, statistical 
testing, regression analysis, analysis of vari- 
ance, and experimental design. Many exercises, 
some answers. KB 


Elementary Statistics, T(13: 2), C.  Ex- 
ploring Statistics: A Modern Introduction to 
Data Analysis and Inference, Second Edition. 
Larry J. Kitchens. Duxbury Pr, 1998, xv + 
940 pp, $73.95, with disk. [ISBN 0-534-26346- 
1] Updated data sets, exercises, and examples 
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(over 1400 exercises and 400 data sets on disk). 
Emphasizes analysis and interpretation of data 
rather than calculations. Actively involves stu- 
dent in the learning process. (First Edition, TR, 
March 1988.) KB 7 


Statistical Methods, P. Additive and Multi- 
plicative Semiparametric Models in Acceler- 
ated Life Testing and Survival Analysis. V. Bag- 
donavicius, M. Nikulin. Papers in Pure & Appl. 
Math., V. 108. Queen’s Univ, 1998, 11+ 109 pp, 
(P). [ISBN 0-8891 1-802-7] 

Statistical Methods, P. Statistical Design and 
Analysis of Experiments. Peter W.M. John. 
Classics in Appl. Math., V. 22. SIAM, 1998, 
xxiv + 356 pp, $40 (P). [ISBN 0-89871-427-3] 
Unabridged republication of the 1971 Macmil- 
lan Publishing Co. edition (TR, May 1971). 
New Preface describes changes in the field since 
the book was first published. 


Statistical Methods, T(13, 17: 2), P, L. A 
First Course in Multivariate Statistics. Bernard 
Flury. Texts in Stat. Springer-Verlag, 1997, 
xv + 713 pp, $79.95. [ISBN 0-387-98206- 
X] Unified treatment of theoretical and prac- 
tical aspects of multivariate statistics. Assumes 
knowledge of basic statistics and matrix alge- 
bra. Topics: multivariate normal distribution, 
parameter estimation, statistical inference for 
means, classification, multivariate analysis of 
variance, logistic regression, principal compo- 
nents, and normal mixtures. KB 
Mathematical Computing, S, P, L. The MAT- 
LAB 5 Handbook. Darren Redfern, Colin 
‘Campbell. Springer-Verlag, 1998, xi + 488 pp, 
$34.95 (P). [ISBN 0-387-94200-9] For ref- 
erence rather than tutorial use. Organized by 
problem category (e.g., linear equations, ordi- 
nary differential equations, animation). Entries 
give command name, parameters, hints, and 
cross-references. 


Mathematical Computing, S, P. The Mathe- 
matica Primer. Kevin R. Coombes, et al. Cam- 
bridge Univ Pr, 1998, xvii + 214 pp, $24.95 (P); 
$64.95. [ISBN 0-521-63715-5; 0-521-63130- 
0] A brief introduction to Mathematica Ver- 
sion 3 for novice users. AO 


Applications (Economics), P, L. Methods 
of Mathematical Finance. Ioannis Karatzas, 
Steven E. Shreve. Applic. of Math., V. 39. 
Springer-Verlag, 1998, xv + 407 pp, $69.95. 
[ISBN 0-387-94839-2] Sets up Brownian 
model for financial markets; treats pricing and 
hedging contingent claims; addresses problems 
faced by agents (one or several) for optimal 
consumption and investment decisions. Reader 
must be familiar with probability and stochastic 
processes. KS 
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Applications (Fluid Mechanics), P. Weakly 
Nonlocal Solitary Waves and _ Beyond-All- 
Orders Asymptotics: Generalized Solitons and 
Hyperasymptotic Perturbation Theory. John P. 
Boyd. Math. & Its Applic., V. 442. Kluwer 
Academic, 1998, xix + 590 pp, $198. [ISBN 
0-7923-5072-3] | 


Applications (Mechanics), P. Computational 
Inelasticity. J.C. Simo, T.J.R. Hughes. Inter- 
disc. Appl. Math., V. 7. Springer-Verlag, 1998, 
xiv + 392 pp, $59.95. [ISBN 0-387-97520-9] 


Applications (Mechanics), P. Generalized 
Analytic Functions: Theory and Applications 
to Mechanics. Eds: Helmut Florian, et al. In- 
tern. Soc. for Analysis, Applic. & Comput., 
V. 1. Kluwer Academic, 1998, xxvi + 311 pp, 
$140. [ISBN 0-7923-5043-X] Proceedings of 
a 1997 conference at the Technical University 
of Graz. First part treats generalized analytic 
functions in the complex plane and in higher di- 
mensions. Second part focuses on applications 
to mechanics. 


Applications (Physics), T(17: 1). The Geom- 
etry of Physics: An Introduction. Theodore 
Frankel. Cambridge Univ Pr, 1997, xxii + 
654 pp, $95. [ISBN 0-521-38334-X] _ Intro- 
duction to differential geometry and topology, 
Lie groups, bundles and forms, etc., for modern 
and classical physics. Good, intuitive develop- 
ment of the fundamental concepts by discussion 
of surfaces in space. RM 


Applications (Systems Theory), P. Lecture 
Notes in Control and Information Sciences— 
234: Neural Networks in Multidimensional 
Domains: Fundamentals and New Trends in 
Modelling and Control. Paolo Arena, et al. 
Springer-Verlag, 1998, xv + 165 pp, $59 (P). 
[ISBN 1-85233-006-6] 


Applications (Systems Theory), P. Lecture 
Notes in Control and Information Sciences— 
232: Experimental Robotics V. Eds: Alicia 
Casals, Anibal T. de Almeida. Springer-Verlag, 
1998, xix + 700 pp, $129 (P). [ISBN 3-540- 
76218-3] Proceedings of a 1997 conference 
in Barcelona, Spain. 


Applications, P. Optical Pattern Recognition. 
Eds: Francis T.S. Yu, Suganda Jutamulia. Cam- 
bridge Univ Pr, 1998, xv + 440 pp, $105. [ISBN 
0-521-46517-6] 15 chapters, each written by 
a specialist, provide an overview of theoretical 
and practical aspects of the subject. 


Reviewers 
KB: Karla Ballman, Macalester; RM: Richard Molnar, 


Macalester; AO: Arnold Ostebee, St. Olaf; KS: Karen 
Saxe, Macalester. 
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Beliefs-Preferences Gauge Symmetry Group and Replication 
of Contingent Claims ind General Market Environment 


This book studies the actual financial phenomena underlying the evaluation of 
financial derivatives, which is today virtually identified with and even replaced 
by the study of the mathematical aspects of stochastic calculus as a model for 
such phenomena. It expresses the view that the study of financial phenomena 

is on the brink of a revolution similar to that of quantum physics in the 
1920s. A fundamental symmetry, a gauge symmetry, is introduced between 
beliefs of market participants and their preferences in a general market 
environment for a market with exchange of an arbitrary number of 
arbitrary underlying securities. The practical applications of this gauge 
symmetry range from the detection of a new type of true arbitrage to 
the beliefs-preferences-independent valuation and dynamic replication 


of contingent claims in a general market environment. 


To Order Call: 1-888-437-4979 * Special Introductory Price $149.00 
ISBN 0-9663032-1-0, Hardcover, 460 Pages. * Additional Discount Available for Full-time Students 
IES Press * P.O. Box 14008 © Research Triangle Park, NC + 27709 
For Other IES Press Publications, See Web Site: www.iesic.com 


THE MATHEMATICAL ASSOCIATION OF AMERICA ‘> 


Julia a life in mathematics 


Constance Reid 


Constance Reid, an established writer about mathematicians, bas 
written an excellent and loving book, about her sister Julia 
Robinson, the mathematician. The author has written that she 
wanis the book to be one for all age groups and she has succeeded 
admirably in making it so..Julia wanted to be known as a mathe- 
matician, not a woman mathematician and rightly so! However, 
she was, and ts, a wonderful role model for women aspiring to be 
mathematician. What a great gift this book would be! 

—Alice Schafer, Former President, AWM 


This book is a small treasure, one which I want to share with all 
my mathematical friends. The assembly of several articles and 
additional photos and remarks provides the image of a mathe- 
matician of extraordinary taste, tenacity and generosity... Julia 
Robinson broke ground in displaying the deep connections 
between number theory and logic. Her results have led to a 
very active area today, making the appearance of this book very 
timely. Her work and her example are however timeless and I 
can think of no better advice to give a young mathematician, 
either in bow to do mathematics. or how to bebave in mathe- 
matics, than: “Be like Julia!” 

—Carol Wood, Deputy Director, MSRI 


Phone in Your Order Now! & 1-800-331-1622 


Julia is the story of the life of Julia Bowman Robinson, the gift- 
ed and highly original mathematician who during her lifetime 
was recognized in ways that no other woman mathematician 
had been recognized up to that time. In 1976 she became the 
first woman mathematician elected to the National Academy of 
Sciences and in 1983 the first woman elected president of the 
American Mathematical Society. 

This unusual book, profusely illustrated with previously 
unpublished personal and mathematical memorabilia, brings 
together in one volume the prizewinning “Autobiography of 
Julia Robinson” by her sister, the popular mathematical 
biographer Constance Reid, and three very personal articles 
about her work by outstanding mathematical colleagues. 


All royalties from sales of this book will go to fund a Julia 
Robinson Prize in Mathematics at the high school from 
which she graduated. 


Catalog Code: JULIA/JR 
136 pp., Hardbound, 1996, ISBN 0-88385-520-8 
List: $27.00 MAA Member: $20.00 


Prentice Hall Advanced Mathematics 


New Titles 


Introduction. to Analysis 
Arthur Mattuck, MIT 


Multivariable Calculus with Vectors 
Hartley Rogers, Jr., MIT 


Differential Equations: 
Modeling with MATLAB® 


Paul Davis, Worchester Polytechnic Institute 
Cleve Moler 


Discrete Mathematics, Fourth Edition 


Ken Ross and Charles Wright, both of the 
University of Oregon 


Abstract Algebra, Second Edition 


David Dummit and Richard Foote, both of the 
University of Vermont 


Numerical Methods Using 
MATLAB®, Third Edition 


John Mathews, California State University 
at Fullerton 
Kurtis Fink, Northwest State University 


A Contextual History of Mathematics 
Ronald Calinger, Catholic University of America 


Introduction to Mathematical 
Programming 
Russell Walker, Carnegie-Mellon University 


Differential Equations and Linear 
Algebra, Second Edition 


Stephen Goode, California State University 
at Fullerton 


Calculus with Early Vectors 


Vector Calculus, Linear Algebra, 
and Differential Form: A Unified 
Approach 


John Hubbard, Cornell University 
Barbara Hubbard 


Advanced Calculus: A Friendly 
Approach 


Witold Kosmala, Appalachian State University 


Introductory Combinatorics, 
Third Edition 


Richard Brualdi, University of Wisconsin 


Foundations of Plane Geometry 


Harvey Blau, Northern Illinois University 
John Wetzel, University of Illinois, Urbana 


Introduction to Topology 


Dennis Roseman, University of lowa 


Partial Differential Equations: 
Sources and Solutions 


Arthur David Snider, University of South Florida 


Principles of Mathematical Problem 
Solving 


Martin Erickson and Joe Flowers, both of Truman 
State University 


A First Course in Fourier Analysis 
David Kammler, Southern Illinois University 


by Phillip Zenor, Edward Slaminka, and Donald Thaxton, all of Auburn University 


Starting with an introduction to vectors in Chapter | and integrating this topic (and differential equations) 
throughout, this text is written for students taking a concurrent calculus-based physics course. Physics and 


engineering applications receive extra emphasis. 


Visit us on the Web at www.prenhall.com 


UK 


Packaged with a PC compatible disk that enhances the 
material in the text. 


Suitable for classroom adoption in an innovative course for 
¢ a general education mathematics elective 
e amathematics or science major advanced elective 
¢ an interdisciplinary course, even at a relatively 
elementary level | 
¢ amathematical modeling course in a 
civil/environmental engineering program 


This book has a dual objective: first, to introduce the reader 
to some of the most important and widespread environmen- 
tal issues of the day; and second, to illustrate the vital role 
played by mathematical models in investigating these issues. 
The environmental issues addressed include: ground-water 
contamination, air pollution, and hazardous material emer- 
gencies. These issues are presented in their full real-world 
context, not as scientific or mathematical abstractions; and 
for background, readers are invited to investigate their status 
in their own communities. 

The first part of the book leads the reader through rela- 
tively elementary modeling of these phenomena, including 
simple algebraic equations for ground water, slightly more 
complex algebraic equations (preferably implemented on a 
spreadsheet or other computerized framework) for air pollu- 
tion, and a fully computerized modeling package for haz- 
ardous materials incident analysis. The interplay between 
physical intuition and mathematical analysis is emphasized. 

For more advanced readers, the second part of the book 
returns to the same three subjects but with a higher level of 
mathematical sophistication (adjustable to the preparation of 


the reader by selection of subsections.) Many important clas- 


sical mathematical themes are developed through this con- 
text, examples coming from single and multivariable calcu- 
lus, differential equations, numerical analysis, linear algebra 
and probability. The material is presented in such a way as to 
minimize the required background and to encourage the 
subsequent study of some of these fields. 

An elementary course for a general audience could be 
based entirely on Part I, and a higher level mathematics, sci- 


Mathematical 
Modeling 
in the Environment 


Charles Hadlock 


Series: Classroom Resource Materials 


ence, or engineering course could move quickly to Part 2. 

A PC compatible diskette packaged with the text con- 
tains a spreadsheet program that facilitates the numerical 
experimentation with the Gaussian plume equation intro- 
duced in Chapter 3, as well as public domain DOS program 
(ARCHIE) for evaluating the consequences from various haz- 
ardous materials scenarios (e.g., the physical extent of flam- 
mable and toxic vapor clouds). Text is not tied to the use of 
this software, but it is included as an aid to meet the peda- 
gogical objectives of the text. 


Catalog Code: ENV/SA 
312 pp., Paperbound, 1998, ISBN 0-88385-709-X 
List: $55.00 MAA Member: $43.95 


Instructor’s and Solutions 
Manual for Mathematical 
Modeling in the Environment 


Charles Hadlock 


Contains the complete solutions and further discussion of 
nearly every exercise presented in the textbook. This includes 
both the mathematical/computational exercises as well as the 
research questions and investigations. Readers will benefit 
greatly from perusing solutions to the problems whether they 
have worked them out themselves or not. Students using this 
volume will still need to work out solutions of research ques- 
tions using their own sources and adapting them to their own 
geographic locations, or using their own computational 
schemes, so this volume could well be useful for students in 
many course contexts. Enrichment material is included on 
the topics of some of the exercises. Advice for teachers who 
lack previous environmental experience, but who want to 
teach this material is also provided and makes it practical for 
such persons to offer a course based on these volumes. 


Catalog Code: EVS/SA 
150 pp., Paperbound, 1998, ISBN 0-88385-713-8 
List: $18.95 MAA Member: $14.95 


Phone in Your Order Now! @ 1-800-331-1622 


Without question, Leonhard Euler (1707-1783) 
ranks among history's greatest mathematicians. 
Across six decades of unmatched productivity, and 
despite a visual impairment that grew ever worse, 
he charted the course of mathematics throughout 
the eighteenth century and beyond. His reputation 
is captured in Laplace’s famous admonition, “Read 
Euler, read Euler. He is the master of us all.” 


Written for the mathematically literate reader, this 
book provides a glimpse of Euler in action. 
Following an introductory biographical sketch are 
chapters describing his contributions to eight differ- 
ent topics—number theory, logarithms, infinite 
series, analytic number theory, complex variables, 


algebra, geometry, and combinatorics. Each chapter 


begins with a prologue to establish the historical 
context and then proceeds to a detailed considera- 
tion of one or more Eulerian theorems on the sub- 
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ject at hand. Each chapter concludes with an epi- 
logue surveying subsequent developments or 
addressing related questions that remain unan- 
swered to this day, At the end of the book is a brief 
outline of Euler’s collected works, the monumental 
Opera Omnia, whose publication has consumed vir- 
tually all of the twentieth century. 


In all, the book contains three dozen proofs from 
this remarkable individual. Yet this is merely the tip 
of the scholarly iceberg, for Euler produced over 
30,000 pages of pure and applied mathematics dur- 
ing his lifetime. Euler: The Master of Us All samples 
the work of a mathematician whose influence, 
industry, and ingenuity are of the very highest order. 
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192 pp., Paperbound, ISBN- 0-88385-328-0 
List: $29.95 MAA Member: $23.95 
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Rings and Things and a Fine Array of 
Twentieth Century Associative Algebra 


Carl Faith, Professor Emeritus, Rutgers University, 
New Brunswick, NJ 


This book surveys more than 125 years of aspects of 
associative algebras, especially ring and module 
theory. It is the first to probe so extensively such a 
wealth of historical development. Moreover, the author 
brings the reader up to date, in particular through his 
report on the subject in the second half of the twenti- 
eth century. 


Included in the book are certain categorical 
properties from theorems of Frobenius and 
Stickelberger on the primary decomposition of 
finite Abelian groups; Hilbert's basis theorem and 
his Nullstellensatz, including the modern formula- 
i tions of the latter by Krull, Goldman, and others; 
Maschke’s theorem on the representation theory 
of finite groups over a field; and the fundamental 
theorems of Wedderburn on the structure of finite 
dimensional algebras and finite skew fields and 
their extensions by Braver, Kaplansky, Chevalley, 
Goldie, and others. A special feature of the book is the 
in-depth study of rings with chain condition on annihilator 
ideals pioneered by Noether, Artin, and Jacobson and 
refined and extended by many later mathematicians. 


Two of the author's prior works, Algebra: Rings, Modules 
and Categories, | and |! (Springer-Verlag, 1973), are 
devoted to the development of modern associative algebra 
and ring and module theory. Those works serve as a foun- 
dation for the present survey, which includes a bibliography 
of over 1,600 references and is exhaustively indexed. 


In addition to the mathematical survey, the author gives 
candid and descriptive impressions of the last half of the 
twentieth century in “Part Il: Snapshots of Some Mathe- 
matical Friends and Places”. Beginning with his teachers 
and fellow graduate students at the University of Ken- 
tucky and at Purdue, Faith discusses his Fulbright-Nato 
Postdoctoral at Heidelberg and at the Institute for 
Advanced Study (IAS) at Princeton, his year as a visiting 
- scholar at Berkeley, and the many acquaintances he met 
there and in subsequent travels in India, Europe, and 
most recently, Barcelona. 


Comments on the book: 


Researchers in algebra should find it both enjoyable to 
read and very useful in their work. In all cases, [Faith] 
cites full references as to the origin and development of 
the theorem .... | know of no other work in print which 
does this as thoroughly and as broadly. 


“Part Il: Snapshots of Some Mathematical Friends and 
Places” is wondertull [It is] a joy to read! Mathematicians 
of my age and younger will relish reading “Snapshots”. 
—James A. Huckaba, 


Mathematical Surveys and Monographs, Volume 65; 
1999; 420 pages; Hardcover; ISBN 0-8218-0993-8; List $99; 
Individual member $59; Order code SURV/65MM91 
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AMERICAN MATHEMATICAL SOCIETY 


Prospects in Mathematics 


Invited Talks on the Occasion of the 
250th Anniversary of Princeton 
University 


Hugo Rossi, Mathematical Sciences Research 
Institute, Berkeley, CA, Editor 


In celebration of Princeton University’s 250th anniver- 
sary, the mathematics department held a conference 
entitled “Prospects in Mathematics”. The purpose of the 
conference was to speculate on future directions of 
research in mathematics. 


This collection of articles provides a rich panorama of 
current mathematical activity in many research areas. 
From Gromov's lecture on quantitative differential topol- 
ogy to Witten’s discussion of string theory, new ideas 
and techniques transfixed the audience of international 
mathematicians. The volume contains 11 articles by 
leading mathematicians, including historical presenta- 
tions by J. Milnor and D. Spencer. It provides a guide to 
some of the most significant mathematical work of the 
past decade. 


Cover picture of Old Fine Hall at Princeton University is courtesy of 
Robert P. Matthews, Communications Department, Princeton University. 


1999; 154 pages; Hardcover; ISBN 0-8218-0975-X; List $29; 
All AMS members $23; Order code PIMMM91 


Edward Witten, /nstitute for 
Advanced Study, Princeton, NJ 


The problem of unifying quantum mechanics and gravity 
in a single coherent theory represents an enormous | 
obstacle to full understanding of the forces of nature. 
The mysterious M-theory has emerged as a likely candi- 
date for such a unifying theory. Whether the “M” stands 
for marvel or matrix, magic or membrane, it is clear that 
this area of research is among the most exciting and 
most profound in all of science today. Edward Witten, 
one of the world’s boldest innovators in this field, pro- 
vides insights into these extraordinary developments in 
a completely expository presentation. Students and 
researchers specializing in mathematics and physics will 
find this lecture especially appealing. However, because 
it is completely nontechnical, large parts of it can easily 
be appreciated by viewers with little or no scientific or 
mathematical training. | 

1998; NTSC format on one-half inch VHS videotape, 
approximately 60 minutes; ISBN 0-8218-1350-1; List $54.95; 
Individual member $34.95; Order code VIDEO/101MM91 
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FERMAT’S LAST 
THEOREM FOR AMATEURS 


In 1995, Andrew Wiles completed a proof of 
Fermat’s last theorem. Although this was certainly 
a great mathematical feat, one shouldn’t dismiss 
earlier attempts made by clever amateurs and 
famous mathematicians alike to solve the problems. 
In this book, aimed at amateurs curious about the 
unfolding of the subject, the author restricts his 
attention exclusively to elementary methods 
which may only have led to partial solutions, but 
their interest goes beyond Fermat’s problem. 


1999/APP. 376 PP./HARDCOVER/$39.95/ISBN 0-387-98508-5 


E. KAMERICH, Catholic University of Nijmegen, 
The Netherlands 


A GUIDE TO MAPLE® 


This “hands-on” book is for people who are 
interested in immediately putting Maple to work. 
The reader is provided with a compact, fast, and 
surveyable guide that introduces them to the 
extensive capabilities of the software. The book 

is sufficient for standard use of Maple and will 
provide techniques for extending Maple for more 
specialized work. The author discusses the reliability 
of results systematically and presents ways of 
testing questionable results. The book allows 

a reader to become a user almost immediately 

and helps him/her to grow gradually to a broader 
and more proficient use. As a consequence, some 
subjects are dealt with in an introductory way 
early in the book, with references to a more detailed 
discussion later on. 


1998/352 PP., 41 ILLUS. /HARDCOVER/$39.95 
ISBN 0-387-94116-9 


JOE MAZUR, Marlboro College, VT 
EXPLORATIONS IN CALCULUS 


Explorations in Calculus is a completely self- 
contained, cross platform CD tutorial package 

and electronic study guide for students taking 
calculus at the college or high school level. 
Among the many features of the CD are: sounds 
and animations, text files, examples and exercises, 
a drawing program, progress checks, feedback, 
hints to problem solving, and cut and paste and 
notepad capabilities. 

1999/APP. 24 PP., 68 ILLUS./$37.95 (TENT.) 


ISBN 0-387-14249-5 
TEXTS IN MATHEMATICAL SCIENCES 


Now available bundled with Student Minitab! 


ALLAN ROSSMAN, Dickinson College, Carlisle, PA and 
BETH L. CHANCE, University of the Pacific, Stockton, CA 


WORKSHOP STATISTICS 
Student Minitab Version 


1999/152 PP./SOFTCOVER/$59.95 (TENT.) 
ISBN 0-387-91580-X 


HEINZ-OTTO PEITGEN and HARTMUT JURGENS, both 
of University of Bremen, Germany, DIETMAR SAUPE, 
University of Freiburg, Germany, EVAN MALETSKY, 
Montclair State University, NJ and TERRY PERCIANTE, 
Wheaton College, IL 


FRACTALS FOR THE CLASSROOM 
STRATEGIC ACTIVITIES 


Volume 3 


This third and final 
volume of Strategic 
Activities focuses on 
fractal images and the 
mechanisms by which 
they are formed. The 
general pattern and 
specific steps used to 
construct a fractal image 
illustrated throughout 
this volume comprise 
an iterated function system. The objective of this 
volume is to investigate the processes and often 
surprising results of applying such systems. The 
contents of this volume joined with the details 
contained in the prior two books provide a compre- 
hensive survey of fractal geometry and chaos 
theory. In addition to the notions of this new and 
emerging discipline, the dynamic nature of the 
research and the experimental characteristics of 
related applications provide an engaging paradigm 
for classroom activity. 


Contents: Connections to the Curriculum 

¢ Foreword (by Jana Wallace) ¢ IFS in Two 
Dimensions ¢ IFS and Geometric Genetic Codes 
e¢ Answers 


1998/APP. 121 PP., 285 ILLUS./SOFTCOVER/$24.95 (TENT.) 
ISBN 0-387-98420-8 
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